(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

International Bureau 




lllllllllllllllllllllllllllllllllllllllllllllllllllll 



(43) International Publication Date (10) International Publication Number 

27 November 2003 (27.11.2003) PCT WO 03/097810 A2 



(51) International Patent Classification 7 : C12N 

(21) International Application Number: PCT/US03/15712 

(22) International Filing Date: 15 May 2003 (15.05.2003) 



(25) Filing Language: 



(26) Publication Language: 



English 



English 



(30) Priority Data: 

10/146,772 
10/241,742 



1 5 May 2002 ( 1 5 .05 .2002) US 
9 September 2002 (09.09.2002) US 



(63) Related by continuation (CON) or continuation-in-part 
(CIP) to earlier applications: 

US 10/241,742 (CIP) 

Filed on 9 September 2002 (09.09.2002) 

US 10/146,772 (CIP) 

Filed on 15 May 2002 (15.05.2002) 

(71) Applicant (for all designated States except US): DI- 
VERSA CORPORATION [US/US]; 4955 Directors 
Place, San Diego, CA 92121 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): DESANTIS, Grace 

[CA/US]; 3520 Lebon Drive, Apt. 5223, San Diego, 
CA 92121 (US). SHORT, Jay, M. [US/US]; 681 Paseo 
Delicias, Box 7214, Rancho Santa Fe, CA 92067 (US). 
BURK, Mark [US/US]; 12634 Intermezzo Way, San 
Diego, CA 92130 (US). WONG, Kevin [US/US]; 11304 



Trebol Street, San Diego, CA 92126 (US). FARWELL, 
Robert [US/US]; 4415 Terreno Court, San Diego, CA 
92124 (US). CHATMAN, Kelly [US/US]; P.O. Box 429, 
Solana Beach, CA 92075 (US). 

(74) Agent: EINHORN, Gregory, P.; Fish & Richardson P.C., 
4350 La Jolla Village Drive, Suite 500, San Diego, CA 
92122 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 

AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, FI, GB, GD, GE, GH, 
GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, 
MX, MZ, NI, NO, NZ, OM, PH, PL, PT, RO, RU, SC, SD, 
SE, SG, SK, SL, TJ, TM, TN, TR, TT, TZ, UA, UG, US, 
UZ, VC, VN, YU, ZA, ZM, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, BG, CH, CY, CZ, DE, DK, EE, 
ES, FT, FR, GB, GR, HU, IE, IT, LU, MC, NL, PT, RO, 
SE, SI, SK, TR), OAPI patent (BF, BJ, CF, CG, CI, CM, 
GA, GN, GQ, GW, ML, MR, NE, SN, TD, TG). 

Published: 

— without international search report and to be republished 
upon receipt of that report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



< 
00 

o\ 

(54) Title: NITRILASES, NUCLEIC ACIDS ENCODING THEM AND METHODS FOR MAKING AND USING THEM 

C^p (57) Abstract: The invention relates to nitrilases and to nucleic acids encoding the nitrilases. In addition methods of designing 
new nitrilases and method of use thereof are also provided. The nitrilases have increased activity and stability at increased pH and 
temperature. 



WO 03/097810 PCT/US03/15712 

NITRILASES, NUCLEIC ACIDS ENCODING THEM AND 
METHODS FOR MAKING AND USING THEM 



CROSS-REFERENCES TO RELATED APPLICATIONS 

[0001 J This application claims the benefit of priority to U.S. Patent Application Serial 
5 No. (USSN) 10/241,742, filed September 9, 2002, and USSN 10/146 3 772 5 filed May 15, 
2002, which claims the benefit of priority to USSN 60/351,336, filed January 22, 2002, 
USSN 60/309, 006, filed July 30, 2001, and USSN 60/300,189, filed June 21, 2001; and is a 
continuation-in-part of USSN 09/751,299, filed December 28, 2000, which claims the benefit 
of priority to each of USSN 60/254,414, filed December 7, 2000, and USSN 60/173,609, 
10 filed December 29, 1999. These applications are hereby incorporated by reference into the 
subject application in their entireties for all purposes. 

COPYRIGHT NOTIFICATION 

[0002] Pursuant to 37 C.F.R. §1.71(e), a portion of this patent document contains 
material which is subject to copyright protection. The copyright owner has no objection to 
15 the facsimile reproduction by anyone of the patent document or the patent disclosure, as it 
appears in the Patent and Trademark Office patent file or records, but otherwise reserves all 
copyright rights whatsoever. 

FIELD OF THE INVENTION 

[0003] The invention relates generally to the field of molecular biology, biochemistry and 
20 chemistry, and particularly to enzymatic proteins having nitrilase activity. The invention also 
relates to polynucleotides encoding the enzymes, and to uses of such polynucleotides and 
enzymes. 

BACKGROUND OF THE INVENTION 

[0004] There are naturally occurring enzymes which have great potential for use in 
25 industrial chemical processes for the conversion of nitriles to a wide range of useful products 
and intermediates. Such enzymes include nitrilases which are capable of converting nitriles 
directly to carboxylic acids. Nitrilase enzymes are found in a wide range of mesophilic 
micro-organisms, including species of Bacillus, Norcardia, Bacteridium, Rhodococcus, 
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micro-organisms, including species of Bacillus, Norcardia, Bacteridium, Rhodococcus, 
Micrococcus, Brevibacterium, Alcaligenes, Acinetobacter, Corynebacterium { , Fusarium and 
Klebsiella, Additionally, there are thermophilic nitrilases which exist in bacteria. 
[0005] There are two major routes from a nitrile to an analogous acid: (1) a nitrilase 
5 catalyzes the direct hydrolysis of a nitrile to a carboxylic acid with the concomitant release of 
ammonia; or (2) a nitrile hydratase adds a molecule of water across the carbon-nitrogen 
bonding system to give the corresponding amide, which can then act as a substrate for an 
amidase enzyme which hydrolyzes the carbon-nitrogen bond to give the carboxylic acid 
product with the concomitant release of ammonia. The nitrilase enzyme therefore provides 

10 the more direct route to the acid. 

[0006] A nitrile group offers many advantages in devising synthetic routes in that it is 
often easily introduced into a molecular structure and can be carried through many processes 
as a masked acid or amide group. This is only of use, however, if the nitrile can be unmasked 
at the relevant step in the synthesis. Cyanide represents a widely applicable Ci-synthon 

15 (cyanide is one of the few water-stable carbanions) which can be employed for the synthesis 
of a carbon framework. However, further transformations of the nitrile thus obtained are 
impeded due to the harsh reaction conditions required for its hydrolysis using normal 
chemical synthesis procedures. The use of enzymes to catalyze the reactions of nitriles is 
attractive because nitrilase enzymes are able to effect reactions with fewer environmentally 

20 hazardous reagents and by-products than in many traditional chemical methods. Indeed, the 
chemoselective biocatalytic hydrolysis of nitriles represents a valuable alternative because it 
occurs at ambient temperature and near physiological pH. 

[0007] The importance of asymmetric organic synthesis in drug design and discovery has 
fueled the search for new synthetic methods and chiral precursors which can be utilized in 

25 developing complex molecules of biological interest. One important class of chiral 

molecules is the a-substituted carboxylic acids, which include the ct-amino acids. These 
molecules have long been recognized as important chiral precursors to a wide variety of 
complex biologically active molecules, and a great deal of research effort has been dedicated 
to the development of methods for the synthesis of enantiomerically pure a-amino acids and 

30 chiral medicines. 

[0008] Of particular use to synthetic chemists who make chiral medicines would be an 
enzyme system which is useful under non-sterile conditions, which is useful in non-biological 
laboratories, which is available in a form convenient for storage and use; which has broad 
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substrate specificity, which acts on poorly water soluble substrates; which has predictable 
product structure; which provides a choice of acid or amide product; and which is capable of 
chiral differentiation. Accordingly, there is a need for efficient, inexpensive, high-yield 
synthetic methods for producing enantiomerically pure a-substituted carboxylic acids, such as, 
5 for example, a-amino acids and a-hydroxy acids. 

[0009] In addition, often, the discovery or evolution of an enzyme to perform a particular 
transformation can be aided by the availability of a convenient high throughput screening or 
selection process. While a surrogate substrate may be used when an effective ultra high 
throughput (UHTP) screen is not available, it may be desirable to screen directly for an enzyme 

10 that performs specifically the desired transformation. The challenges of designing an UHTP 

screen is evident when, for example, the discovery or evolution program is aimed at uncovering 
a stereoselective transformation to generate only one stereoisomer or enantiomer. fti this case, 
there is a paucity of high throughput screening methods available. While, the most 
straightforward method is to use chiral liquid or gas phase separation to separate the two 

1 5 enantiomers in question, often this approach does not afford the very high throughput capacity 
that is required. By using mass spectroscopy (MS) techniques, very high throughput screens are 
possible. However, when applied in a conventional manner, MS does not afford information on 
chirality or enantioselectivity. 

[00010] Another approach is to chemically derivatize the enantiomeric mixture with a single 
20 enantiomer compound, thus generating a diasteriomeric mixture of compounds that can be 
characterized by separation on an achiral stationary phase. Again, this is a cumbersome 
approach and does not lend itself well to high throughput screening. 

[00011] Throughout this application, various publications are referenced by author and 
date. The disclosures of these publications in their entireties are hereby incorporated by 
25 reference into this application in order to more fully describe the state of the art as known to 
those skilled therein as of the date of the invention described and claimed herein. 

SUMMARY OF THE INVENTION 

[00012] The present invention is directed to an isolated or recombinant nucleic acid 

comprising nucleotides having a sequence at least about 50% identical to SEQ ID NO: 1,3, 
30 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 
59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 
107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 
143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 
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179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 
215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 
251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 
287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 
5 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 
359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, or variants thereof, 
wherein the nucleic acid encodes a polypeptide having a nitrilase activity. In alternative 
aspects of the invention, the nucleic acid comprises nucleotides having a sequence at least 
about 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 60%, 61%, 62%, 63%, 64%, 

10 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 
81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 
97%, 98%, 99%, or more, or complete identity (100% identical) to the SEQ ID NO: or 
variants thereof. Exemplary variants may include, for example, the following variations of 
SEQ ID NO:195, 205, 207, 209, OR 237, having one or more mutations: at positions 163- 

15 165 AAA, AAG, GGT, GGC, GGA, GGG, CAA, or CAG; at positions 178-180 GAA or 
GAG; at positions 331-333 TCT, TCC, TCA, TCG, AGT, or AGC; at positions 568-570 
CAT, CAC, TCT, TCC, TCA, TCG, AGT, AGC, ACT, ACC, ACA, TCA, TAT, TAG, ATG 
or ACG; at positions 571-573 TTA, TTG, CTT, CTC, CTA, CTG, GTT, GTC, GTA, GTG, 
ATG, ACT, ACC, ACA, GAT, GAC, GGT, GGC, GGA, GGG, GAA, GAG, TAT, TAC, or 

>0 ACG; at positions 595-597 GAA, GAG, TTA, TTG, CTT, CTC, CTA, or CTG; at positions 
664-666 TTA, TTG, CTT, CTC, CTA, or CTG; or any combination thereof. In one aspect of 
the invention, the variants encode a polypeptide having improved or diminished 
enantioselectivity, for example, in the conversion of a 3-hydroxyglutarylnitrile (HGN) to (R)- 
4-Cyano-3-hydroxybutyrate, than the polypeptide encoded by the SEQ ID NO. 

15 [00013] In one aspect of the invention, the nucleic acid comprises nucleotides having a 

sequence substantially identical to the SEQ ID NO: or variants thereof. In another aspect, the 
invention provides for an isolated or recombinant nucleic acid comprising consecutive 
nucleotides having a sequence at least 79 % identical to SEQ ID NO: 33, wherein the nucleic 
acid encodes a polypeptide having nitrilase activity. The invention provides for a fragment of 

10 the nucleic acid, wherein the fragment encodes a polypeptide having nitrilase activity. The 
invention also provides for an isolated or recombinant nucleic acid complementary to any of 
the nucleic acids. The invention also provides for an isolated or recombinant nucleic acid 
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that hybridizes to any one of the nucleic acids under stringent conditions. In one aspect, the 
stringent conditions comprise at least 50% fonnamide, and about 37°C to about 42°C. 
[00014] The invention provides for a nucleic acid probe comprising from about 15 
nucleotides to about 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 
5 150, 200, 250, 300, 350, 400, 450, 500 or more nucleotides, wherein at least 10, 11, 12, 13, 
14, 15, 16, 17, 18, 19, 20 or more consecutive nucleotides are at least 50% complementary to 
a nucleic acid target region within a nucleic acid sequence of SEQ ID NO: 1, 3, 5, 7, 9, 11, 
13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 
65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 

10 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 
147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 
183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 
219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 
255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 

15 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 
327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 359, 361, 
363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, variants thereof, or their 
complements. In one aspect, the nucleic acid probe comprises consecutive nucleotides which 
are at least 55% complementary to the nucleic acid target region. la one aspect, the invention 

20 provides for a nucleic acid probe, wherein the consecutive nucleotides are at least 50%, 51%, 
52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 
68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 
84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 
or more or 100% complementary to the nucleic acid target region. In another aspect, the 

25 nucleic acid consists essentially of from about 20 to about 50 nucleotides. In other aspects, 
the nucleic acid can be at least about 20, 25, 30, 35, 40, 45, 50, 75, 100, 150 nucleotides in 
length. 

[00015] The invention provides for a nucleic acid vector capable of replication in a host 
cell, wherein the vector comprises the nucleic acid of the invention. The invention also 
30 provides for a host cell comprising the nucleic acid. The invention also provides for a host 
organism comprising the host cell. In one aspect, the host organism comprises a gram 
negative bacterium, a gram positive bacterium or a eukaryotic organism. In another aspect, 
the gram negative bacterium comprises Escherichia colU or Pseudomonas fluorescens. In a 
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further aspect, the gram positive bacterium comprises Streptomyces diversa, Lactobacillus 
gasseri, Lactococcus lactis, Lactococcus cremoris, or Bacillus subtilis. In a further aspect, 
the eukaryotic organism comprises Saccharomyces cerevisiae, Schizosaccharomyces pombe, 
Pichia pastoris > Kluyveromyces lactis, Hansenula plymorpha, or Aspergillus niger. 
5 [00016] The invention provides for an isolated or recombinant nucleic acid encoding a 
polypeptide comprising amino acids having a sequence at least 50% identical to SEQ ID 
NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 
52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 
100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 

10 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 
172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 
208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 
244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 
280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 

15 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 
352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, or 
variants thereof, wherein the polypeptide has nitrilase activity. In one aspect, the polypeptide 
comprises amino acids having at least about 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 
58%, 59%, 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 

>0 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 
90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more or 100% identity to the 
SEQ ID NO: or variants thereof. Exemplary variants may include, for example, the 
following variations of SEQ ID NO: 196, 206, 208, 210 or 238, having one or more 
mutations: at residue 55 lysine, glycine, or glutamine; at residue 60 glutamic acid; at residue 

15 111 serine, at residue 190, serine, histidine, tyrosine or threonine; at residue 191, leucine, 

valine, methionine, aspartic acid, glycine, glutamic acid, tyrosine or threonine; at residue 199 
glutamic acid or leucine; at residue 222 leucine; or any combination thereof. 
[00017] The invention also provides for an isolated or recombinant nucleic acid encoding a 
polypeptide comprising at least 10 consecutive amino acids having a sequence identical to a 

SO portion of an amino acid sequence of SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 
28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 
78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 
120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 
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156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 
192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 
228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 
264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 
5 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 
336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 
372, 374, 376, 378, 380, 382, 384, 386, or variants thereof. 

[00018] An isolated or recombinant polypeptide comprising amino acids having a 
sequence at least about 50% identical to SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 

10 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 
76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 
118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 
154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 
190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 

15 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 
262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 
298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 
334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 
370, 372, 374, 376, 378, 380, 382, 384, 386, or variants thereof, wherein the polypeptide has 

20 nitrilase activity. In one aspect of the invention, the polypeptide comprises amino acids 
having a sequence at least about 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 
60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 
76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more or 100% identical to the SEQ ID NO: 

15 or variants thereof. 

[00019] The invention provides an isolated or recombinant nucleic acid comprising 
nucleotides having a sequence as set forth in any one of the following SEQ ID NOS:l, 3, 5, 7, 
9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 
61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 

50 107, 109, 1 1 1, 1 13, 1 15, 1 17, 1 19, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 
143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 
179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 
215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 
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251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 
287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 
323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 
359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, and variants thereof 
(hereinafter referred to as "Group A nucleic acids"). The invention is also directed to nucleic 
acids having specified minimum percentages of sequence identity to any of the Group A 
nucleic acids sequences. 

[00020] In another aspect, the invention provides an isolated (purified) or recombinant 
polypeptide comprising amino acid residues having a sequence as set forth in any one of the 
following SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 
42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 
92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 
130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 
166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 
202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 
238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 
274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 
310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 
346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 
382, 384, 386, and variants thereof, (hereinafter referred to as "Group B amino acid 
sequences"). The invention is also directed to purified polypeptides having specified 
minimum percentages of sequence identity to any of the Group B amino acid sequences. 
[00021] The invention provides for a fragment of the polypeptide which is at least 50 
amino acids in length, and wherein the fragment has nitrilase activity. Furthermore, the 
invention provides for a peptidomimetic of the polypeptide or a fragment thereof having 
nitrilase activity. The invention provides for a codon-optimized polypeptide or a fragment 
thereof, having nitrilase activity, wherein the codon usage is optimized for a particular 
organism or cell. Narum et al. Infect. Immun. 2001 Dec, 69(12):7250-3 describes codon- 
optimzation in the mouse system. Outchkourov et al. Protein Expr. Purif. 2002 Feb; 
24(1): 18-24 describes codon-optimization in the yeast system. Feng et al. Biochemistry 2000 
Dec 19, 39(50):15399-409 describes codon-optimization in E. coli. Humphreys et al. Protein 
Expr. Purif. 2000 Nov, 20(2):252-64 describes how codon usage affects secretion in E. coli. 
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[00022] la one aspect, the organism or cell comprises a gram negative bacterium, a gram 
positive bacterium or a eukaryotic organism. In another aspect of the invention, the gram 
negative bacterium comprises Escherichia coli, or Pseudomonas fluorescens. In another 
aspect of the invention, the gram positive bacterium comprise Streptomyces diversa, 
5 Lactobacillus gasseri, Lactococcus lactis^ Lactococcus cremoris, or Bacillus subtilis. In 
another aspect of the invention, the eukaryotic organism comprises Saccharomyces 
cerevisiae, Schizosaccharomyces pombe, Pichia pastoris, Kluyveromyces lactis, Hansenula 
plymorpha, or Aspergillus niger, 

[00023] In another aspect, the invention provides for a purified antibody that specifically 
10 binds to the polypeptide of the invention or a fragment thereof, having nitrilase activity, hi 
one aspect, the invention provides for a fragment of the antibody that specifically binds to a 
polypeptide having nitrilase activity. 

[00024] The invention provides for an enzyme preparation which comprises at least one of 
the polypeptides of the invention, wherein the preparation is liquid or dry. The enzyme 

15 preparation includes a buffer, cofactor, or second or additional protein. In one aspect the 

preparation is affixed to a solid support. In one aspect of the invention, the solid support can 
be a gel, a resin, a polymer, a ceramic, a glass, a microelectrode and any combination thereof 
In another aspect, the preparation can be encapsulated in a gel or a bead. 
[00025] The invention further provides for a composition which comprises at least one 

10 nucleic acid of the invention which comprises at least one polypeptide of the invention or a 
fragment thereof, or a peptidomimetic thereof, having nitrilase activity, or any combination 
thereof. 

[00026] The invention provides for a method for hydrolyzing a nitrile to a carboxylic acid 
comprising contacting the molecule with at least one polypeptide of the invention or a 
'5 fragment thereof, or a peptidomimetic thereof, having nitrilase activity, under conditions 

suitable for nitrilase activity. In one aspect, the conditions comprise aqueous conditions. In 
another aspect, the conditions comprise a pH of about 8.0 and/or a temperature from about 
37° C to about 45° C. 

[00027] The invention provides for a method for hydrolyzing a cyanohydrin moiety or an 
0 aminonitrile moiety of a molecule, the method comprising contacting the molecule with at 
least one polypeptide of the invention, or a fragment thereof, or a peptidomimetic thereof, 
having nitrilase activity, under conditions suitable for nitrilase activity. 
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[00028] The invention provides for a method for making a chiral a-hydroxy acid 
molecule, a chiral amino acid molecule, a chiral p -hydroxy acid molecule, or a chiral gamma- 
hydroxy acid molecule, the method comprising admixing a molecule having a cyanohydrin 
moiety or an aminonitrile moiety with at least one polypeptide having an amino acid 
5 sequence at least 50% identical to any one of the Group B amino acid sequences or a 

fragment thereof, or a peptidomimetic thereof, having enantio-selective nitrilase activity. In 
one aspect, the chiral molecule is an (K>enantiomer. In another aspect, the chiral molecule is 
an (S^-enantiomer. In one aspect of the invention, one particular enzyme can have R- 
specificity for one particular substrate and the same enzyme can have S-specificity for a 

1 0 different particular substrate. 

[00029] The invention also provides for a method for making a composition or an 
intermediate thereof, the method comprising admixing a precursor of the composition or 
intermediate, wherein the precursor comprises a cyanohydrin moiety or an aminonitrile 
moiety, with at least one polypeptide of the invention or a fragment or peptidomimetic 

15 thereof having nitrilase activity, hydrolyzing the cyanohydrin or the aminonitrile moiety in 
the precursor thereby making the composition or the intermediate thereof. In one aspect, the 
composition or intermediate thereof comprises (^-2-amino-4-phenyl butanoic acid. In a 
further aspect, the composition or intermediate thereof comprises an L- amino acid. In a 
further aspect, the composition comprises a food additive or a pharmaceutical drug. 

20 [00030] The invention provides for a method for making an (7?) -ethyl 4-cyano-3- 

hydroxybutyric acid, the method comprising contacting a hydroxyglutaryl nitrile with at least 
one polypeptide having an amino acid sequence of the Group B amino acid sequences, or a 
fragment or peptidomimetic thereof having nitrilase activity that selectively produces an (R)- 
enantiomer, so as to make (K>ethyl 4-cyano-3-hydroxybutyric acid. In one aspect, the ee is 

25 at least 95% or at least 99%. In another aspect, the hydroxyglutaryl nitrile comprises 1,3-di- 
cyano-2-hydroxy-propane or 3 -hydroxy glutaronitrile . In a further aspect, the polypeptide has 
an amino acid sequence of any one of the Group B amino acid sequences, or a fragment or 
peptidomimetic thereof having nitrilase activity. 

[00031] The invention also provides a method for making an (S>ethyl 4-cyano-3- 
30 hydroxybutyric acid, the method comprising contacting a hydroxyglutaryl nitrile with at least 
one polypeptide having an amino acid sequence of the Group B amino acid sequences, or a 
fragment or peptidomimetic thereof having nitrilase activity that selectively produces an (S)- 
enantiomer, so as to make (S)-Qthy\ 4-cyano-3-hydroxybutyric acid. 

10 
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[00032] The invention provides a method for making an (7?>mandelic acid, the method 
comprising admixing a mandelonitrile with at least one polypeptide having an amino acid 
sequence of any one of the Group B amino acid sequences or any fragment or 
peptidomimetic thereof having appropriate nitrilase activity. In one aspect, the (7?J-mandelic 
5 acid comprises (K>2-chloromandelic acid. In another aspect, the (K>mandelic acid 

comprises an aromatic ring substitution in the ortho-, meta~, or para- positions; a 1-naphthyl 
derivative of (^-mandelic acid, a pyridyl derivative of (2?>mandelic acid or a thienyl 
derivative of (2f)-mandelic acid or a combination thereof. 

[00033] The invention provides a method for making an (2>>mandelic acid, the method 
10 comprising admixing a mandelonitrile with at least one polypeptide having an amino acid 
sequence of Group B sequences or any fragment or peptidomimetic thereof having nitrilase 
activity. In one aspect, the (S^-mandelic acid comprises f$ -methyl benzyl cyanide and the 
mandelonitrile comprises (S>methoxy-benzyl cyanide, hi one aspect, the (2^-mandelic acid 
comprises an aromatic ring substitution in the ortho-, met a-, or para- positions; a 1-naphthyl 
15 derivative of (^-mandelic acid, a pyridyl derivative of ^-mandelic acid or a thienyl 
derivative of (2^-mandelic acid or a combination thereof. 

[00034] The invention also provides a method for making an (^-phenyl lactic acid 
derivative or an (2?>phenyllactic acid derivative, the method comprising admixing a 
phenyllactonitrile with at least one polypeptide selected from the group of the Group B amino 
20 acid sequences or any active fragment or peptidomimetic thereof that selectively produces an 
(S>enantiomer or an (72>enantiomer, thereby producing an (^-phenyl lactic acid derivative 
or an (7?) -phenyl lactic acid derivative. 

[00035] The invention provides for a method for making the polypeptide of the invention 
or a fragment thereof, the method comprising (a) introducing a nucleic acid encoding the 
25 polypeptide into a host cell under conditions that permit production of the polypeptide by the 
host cell, and (b) recovering the polypeptide so produced. 

[00036] The invention provides for a method for generating a nucleic acid variant 
encoding a polypeptide having nitrilase activity, wherein the variant has an altered biological 
activity from that which naturally occurs, the method comprising (a) modifying the nucleic 
30 acid by (i) substituting one or more nucleotides for a different nucleotide, wherein the 
nucleotide comprises a natural or non-natural nucleotide; (ii) deleting one or more 
nucleotides, (iii) adding one or more nucleotides, or (iv) any combination thereof. In one 
aspect, the non-natural nucleotide comprises inosine. In another aspect, the method further 
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comprises assaying the polypeptides encoded by the modified nucleic acids for altered 
nitrilase activity, thereby identifying the modified nucleic acid(s) encoding a polypeptide 
having altered nitrilase activity. In one aspect, the modifications of step (a) are made by 
PGR, error-prone PGR, shuffling, oligonucleotide-directed mutagenesis, assembly PGR, 
sexual PGR mutagenesis, in vivo mutagenesis, cassette mutagenesis, recursive ensemble 
mutagenesis, exponential ensemble mutagenesis, site-specific mutagenesis, gene reassembly, 
gene site saturated mutagenesis, ligase chain reaction, in vitro mutagenesis, ligase chain 
reaction, oligonuclteotide synthesis, any DNA-generating technique and any combination 
thereof. In another aspect, the method further comprises at least one repetition of the 
modification step (a). 

[00037] The invention further provides a method for making a polynucleotide from two or 
more nucleic acids, the method comprising: (a) identifying regions of identity and regions of 
diversity between two or more nucleic acids, wherein at least one of the nucleic acids 
comprises a nucleic acid of the invention; (b) providing a set of oligonucleotides which 
correspond in sequence to at least two of the two or more nucleic acids; and, (c) extending the 
oligonucleotides with a polymerase, thereby making the polynucleotide. 
[00038] The invention further provides a screening assay for identifying a nitrilase, the 
assay comprising: (a) providing a plurality of nucleic acids or polypeptides comprising at least 
one of the nucleic acids of the invention, or at least one of the polypeptides of the invention; (b) 
obtaining polypeptide candidates to be tested for nitrilase activity from the plurality; (c) 
testing the candidates for nitrilase activity; and (d) identifying those polypeptide candidates 
which are nitrilases. In one aspect, the method further comprises modifying at least one of 
the nucleic acids or polypeptides prior to testing the candidates for nitrilase activity. In 
another aspect, the testing of step (c) further comprises testing for improved expression of the 
polypeptide in a host cell or host organism. In a further aspect, the testing of step (c) further 
comprises testing for nitrilase activity within a pH range from about pH 3 to about pH 12. In 
a further aspect, the testing of step (c) further comprises testing for nitrilase activity within a 
pH range from about pH 5 to about pH 10. In another aspect, the testing of step (c) further 
comprises testing for nitrilase activity within a temperature range from about 4 °C to about 80 
°C. In another aspect, the testing of step (c) further comprises testing for nitrilase activity 
within a temperature range from about 4 °C to about 55 °C. In another aspect, the testing of 
step (c) further comprises testing for nitrilase activity which results in an enantioselective 
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reaction product. In another aspect, the testing of step (c ) further testing for nitrilase activity 
which results in a regio-selective reaction product. 

[00039] The invention provides for use of the nucleic acids of the invention, or a fragment 
or peptidomimetic thereof having nitrilase activity, in a process designed to optimize one 
aspect of the gene or one aspect of the polypeptide encoded by the gene. In one aspect, the 
process comprises introducing modifications into the nucleotide sequence of the nucleic acid, 
ha another aspect, the modifications are introduced by PGR, error-prone PGR, shuffling, 
oligonucleotide-directed mutagenesis, assembly PCR, sexual PCR mutagenesis, in vivo 
mutagenesis, cassette mutagenesis, recursive ensemble mutagenesis, exponential ensemble 
mutagenesis, site-specific mutagenesis, gene reassembly, gene site saturated mutagenesis, 
ligase chain reaction, in vitro mutagenesis, ligase chain reaction, oligonucleotide synthesis, 
any other DNA-generating technique or any combination thereof In a further aspect, the 
process is repeated. 

[00040] The invention provides for use of the polypeptide of the invention, or a fragment 
or peptidomimetic thereof having nitrilase activity, in an industrial process, hi one aspect, 
the process is for production of a pharmaceutical composition, the process is for production 
of a chemical, the process is for production of a food additive, the process is catalyzing the 
breakdown of waste, or the process is production of a drug intermediate. In a further aspect, 
the process comprises use of the polypeptide to hydrolyze a hydroxyglutarylnitrile substrate. 
In a further aspect, the process is for production of LIPITOR™. In another aspect, the 
polypeptide used comprises a polypeptide having consecutive amino acids of the sequence 
SEQ ID NO:44, 196, 208, 210, or 238 or a fragment thereof having nitrilase activity. In 
another aspect, the process is production of a detergent. In another aspect, the process is 
production of a food product. 

[00041] The invention provides for use of a nucleic acid of die invention, or a fragment 
thereof encoding a polypeptide having nitrilase activity, in the preparation of a transgenic 
organism. 

[00042] The invention provides for a kit comprising (a) the nucleic acid of the inventions, 
or a fragment thereof encoding a polypeptide having nitrilase activity, or (b) the polypeptide 
of the invention, or a fragment or a peptidomimetic thereof having nitrilase activity, or a 
combination thereof; and (c) a buffer. 

[00043] The invention provides for a method for modifying a molecule comprising: (a) 
mixing a polypeptide of the invention or a fragment or peptidomimetic thereof having 
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nitrilase activity, with a starting molecule to produce a reaction mixture; (b) reacting the 
starting molecule with the polypeptide to produce the modified molecule. 
[00044] The invention provides for a method for identifying a modified compound 
comprising: (a) admixing a polypeptide of the invention, or a fragment or peptidomimetic 
thereof having nitrilase activity, with a starting compound to produce a reaction mixture and 
thereafter a library of modified starting compounds; (b) testing the library to determine 
whether a modified starting compound is present within the library which exhibits a desired 
activity; (c) identifying the modified compound exhibiting the desired activity. 
[00045] The invention provides a screening assay for enantioselective transformation 
comprising: (a) providing a molecule having two prochiral or enantiotopic moieties; (b) 
labeling at least one prochiral or enantiotopic moiety of the molecule; (b) modifiying at least 
one of the two moieties by a selective catalyst; and (c) detecting results by mass 
spectroscopy. The screening assay can be used to determine or monitor the % enantiomeric 
excess (ee) or determine the % diasteromeric excess (de). An exemplary label useful in the 
assay is a heavier isotope or a liter isotope. The selective catalyst useful in the assay can be 
•an enzyme. The screening assay may be performed with both moieties labeled. The 
screening assay may be performed in both directions, i.e., from the reactants to the products 
as well as from the products to the reactants. 

[00046] The invention provides for a computer readable medium having stored thereon a 
nucleic acid of the invention, e.g., a nucleic acid comprising at least one nucleotide sequence 
selected from the group consisting of: SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 
27, 29, 31, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 
79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 
121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 
157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 
193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 
229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 
265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 
301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 
337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 
373, 375, 377, 379, 381, 383, 385, and variants thereof, and/or at least one amino acid 
sequence selected from the group consisting of: SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 
22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 
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72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 
116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 
152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 
188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 
5 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 
260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 
296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 
332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 
368, 370, 372, 374, 376, 378, 380, 382, 384, 386, and variants thereof. 

1 0 [00047] The invention provides for a computer system comprising a processor and a data 
storage device, wherein the data storage device has stored thereon a nucleic acid of the 
invention, e.g., a nucleic acid comprising at least one nucleotide sequence selected from the 
group consisting of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 35, 
37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 

15 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 
127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 
163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 
199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 
235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 

20 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 
307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 
343, 345, 347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 
379, 381, 383, 385, and variants thereof, and/or at least one amino acid sequence selected 
from the group consisting of: SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 

25 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 
82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 
124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 
160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 
196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 

30 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 
268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 
304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 
340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 
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376, 378, 380, 382, 384, 386, and variants thereof. In one aspect, the computer system 
further comprises a sequence comparison algorithm and a data storage device having at least 
one reference sequence stored thereon. In another aspect, the sequence comparison algorithm 
comprises a computer program that identifies polymorphisms. 
5 [00048] The invention provides for a method for identifying a feature in a sequence which 
comprises: (a) inputting the sequence into a computer; (b) running a sequence feature 
identification program on the computer so as to identify a feature within the sequence; and (c) 
identifying the feature in the sequence, wherein the sequence comprises a nucleic acid of the 
invention, e.g., a nucleic acid comprising at least one of SEQ ID NOS: 1-386, its variants, or 

1 0 any combination thereof. 

[00049] The invention provides for an assay for identifying a functional fragment of a 
polypeptide which comprises: (a) obtaining a fragment of at least one polypeptide of the 
invention; (b) contacting at least one fragment from step (a) with a substrate having a 
cyanohydrin moiety or an aminonitrile moiety under reaction conditions suitable for nitrilase 

1 5 activity; (c) measuring the amount of reaction product produced by each at least one fragment 
from step (b); and (d) identifying the at least one fragment which is capable of producing a 
nitrilase reaction product; thereby identifying a functional fragment of the polypeptide. In 
one aspect, the fragment of step (a) is obtained by synthesizing the fragment. In another 
aspect, the fragment of step (a) is obtained by fragmenting the polypeptides. 

20 [00050] The invention provides for an assay for identifying a functional variant of a 

polypeptide which comprises: (a) obtaining at least one variant of at least one polypeptide of 
the invention; (b) contacting at least one variant from step (a) with a substrate having a 
cyanohydrin moiety or an aminonitrile moiety under reaction conditions suitable for nitrilase 
activity; (c) measuring the amount of reaction product produced by each at least one variant 

25 from step (b); and (d) identifying the at least one variant which is capable of producing a 
nitrilase reaction product; thereby identifying a functional variant of the polypeptide. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[00051] Figure 1 shows chemical reaction schemes wherein stereoselective nitrilases 
hydrolyze a cyanohydrin or an aminonitrile to produce a chiral a-hydroxy acid or a-amino 
30 acid. 

[00052] Figure 2 illustrates an OPA based cyanide detection assay used for identifying the 
presence of nitrilase activity. 
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[00053] Figure 3 is an illustration of a spectroscopic system for the detection and 
quantification of a-hydroxy acids based on stereoselective lactate dehydrogenases. 
[00054] Figure 4 is an illustration of a spectroscopic system for the detection and 
quantification of a-amino acids based on stereoselective amino acid oxidase. 
5 [00055] Figure 5 is a flow diagram illustrating the steps of a nitrilase screening method. 
[00056] Figures 6A-6E are chromatograms characteristic of the substrate and product 
combination for D-phenylglycine showing a blank sample (Fig. 6A), an enzymatic reaction 
sample (Fig. 6B); a negative control consisting of cell lysate in buffer (Fig. 6C); a chiral 
analysis of phenylglycine (Fig. 6D); and coelution of the nitrile peak with the D-enantiomer 
10 (Fig. 6E). 

[00057] Figures 7A-7E illustrate chromatograms which are characteristic of substrate and 
product combinations for (i?)-2-chloromandelic acid. Fig. 7A shows only 2- 
chloromandelonitrile in buffer; Fig. 7B shows a cloromandelic acid standard. The 
chromatogram in Fig. 7C shows the appearance of product and the reduction of substrate 
15 peaks. 

[00058] Figures 8A-8B illustrate chromatograms characteristic of substrate and product 
combinations for (5)-phenyllactic acid. 

[00059] Figures 9A-9B illustrate chromatograms characteristic of substrate and product 
combinations for L-2-methylphenylglycine. 
20 [00060] Figures 1 OA- 10C illustrate chromatograms characteristic of substrate and product 
combinations for L-teftf-leucine. 

[00061] Figures 1 1 A-l 1C illustrate chromatograms characteristic of substrate and product 
combinations for (S)-2-amino-6-hydroxy hexanoic acid. 

[00062] Figures 12A-12D illustrate chromatograms characteristic of substrate and product 
25 combinations for 4— methyl-D-leucine and 4-methyl-L-leucine. 

[00063] Figures 13 A-13B illustrate chromatograms characteristic of substrate and product 
combinations for (S)-cyclohexylmandelic acid. 

[00064] Figures 14A-14B illustrate two exemplary standard curves for quantitation in 
connection with the screening assay of the invention. 
30 [00065] Figure 15 illustrates selected compounds that can be produced from a nitrilase- 
catalyzed reaction using an enzyme and/or a method of the invention. 
[00066] Figure 16 illustrates selected compounds that can be produced from a nitrilase- 
catalyzed reaction using an enzyme and/or a method of the invention. 
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[00067] Figure 17 illustrates an exemplary nitrilase reaction of the invention. 
DETAILED DESCRIPTION OF THE INVENTION 

[00068] The present invention relates to nitrilases, nucleic acids encoding nitrilases, and 
uses therefor. As used herein, the term "nitrilase" encompasses any polypeptide having any 
nitrilase activity, for example, the ability to hydrolyze nitriles into their corresponding 
carboxylic acids and ammonia. Nitrilases have commercial utility as biocatalysts for use in 
the synthesis of enantioselective aromatic and aliphatic amino acids or hydroxy acids. 



R 2 / Rl nitrilase R 2 P 1 R 



2 



R ^CN Rs (DN R 3 x cOOH 

[00069] Nitrilase chemistry is as follows: 

[00070] A nitrilase reaction for the preparation of hydroxy acids is as follows: 

OH OH OH 
R K / nitrilase K \ f f\ W 
► Xf or 





R 3 \ R 3 \ R 3 

CN COOH ^COOH 



[00071] A nitrilase reaction for the preparation of amino acids is as follows: 
NH 2 

2 V / nitrilase 



R 3 




CN 




[00072] In addition, in each of the foregoing hydrolysis reactions, two water molecules are 
consumed and one ammonia molecule is released. 

[00073] There are several different types of assays which can be performed to test for the 
presence of nitrilase activity in a sample or to test whether a particular polypeptide exhibits 
nitrilase activity. For example, assays can detect the presence or absence of products or by- 
products from a chemical reaction catalyzed by a nitrilase. For example, the presence of 
nitrilase activity can be detected by the production of a-hydroxy acids or a-amino acids 
from, respectively, cyanohydrins or aminonitriles, and the level of nitrilase activity can be 
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quantified by measuring the relative quantities of the reaction products produced. Figure 1 
shows chemical reaction schemes using stereoselective nitrilases to create chiral a-hydroxy 
acids or a-amino acids in high yield. The starting material is an aldehyde or an imine which 
is produced from an aldehyde by reaction with ammonia. Reaction of the aldehyde or imine 
5 with hydrogen cyanide results in the production of enantiomeric mixtures of the 

corresponding cyanohydrins and aminonitriles. A stereoselective nitrilase can then be used to 
stereoselectively convert one enantiomer into the corresponding a-hydroxy acid or a-amino 
acid. Figure 3 illustrates schematically the stereoselective nitrilase-dependent production and 
spectrophotometric detection of a-hydroxy acids based on lactate dehydrogenase conversion 

10 of the a-hydroxy acids to the corresponding a-keto acids and concomitant oxidation- 
reduction of a detectable dye. Figure 4 illustrates schematically the stereoselective nitrilase- 
dependent production and spectrophotometric detection of a-amino acids based on amino 
acid oxidase conversion of the a-amino acids to the corresponding a-keto acids and 
concomitant oxidation-reduction of a detectable dye. 

1 5 [00074] Nitrilases contemplated for use in the practice of the present invention include 
those which stereoselectively hydrolyze nitriles or cyanohydrins into their corresponding 
acids and ammonia. In one aspect, nitrilases of the invention can stereoselectively hydrolyze 
nitriles or cyanohydrins into their corresponding acids and ammonia. Nitrilases include, for 
example, nitrilases of the invention, e.g., those set forth in the Group B amino acid 

20 sequences. Some nitrilases which stereoselectively hydrolyze their substrates are set forth in 
the Tables hereinbelow. 

[00075] The nitrilases of the invention share the following additional characteristics: 

(1) full-length amino acid sequences from about 333 amino acids to about 366 amino acids, 

(2) aggregation and activity as homo-multimers of about 2 subunits to about 16 subunits, 
25 (3) presence of a catalytic triad of the consecutive amino acids Glu-Lys-Cys, (4) pH optima 

from about pH 5 to about pH 9, and (5) temperature optima from about 0°C to about 100°C, 
or from about 40°C to about 50°C. 

Consensus Sequences Among New Nitrilases 
[00076] The nitrilases disclosed herein were studied using bioinformatics and sequence 
30 comparison programs and the following consensus information was collected. Three regions 
of conserved motifs were identified within the nitrilase polypeptides. These correspond to 
the catalytic triad (E-K-C) present in nitrilase enzymes. (H. Pace and C. Brenner (Jan. 15, 
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2001) "The Nitrilase Superfamily: classification, structure and function" Genome Biology 
Vol. 2, No. l 9 pp 1-9.) 

[00077] The abbreviations used herein are conventional one letter codes for the amino 
acids: A, alanine; B, asparagine or aspartic acid; C, cysteine; D aspartic acid; E, glutamate, 
5 glutamic acid; F, phenylalanine; G, glycine; H histidine; I isoleucine; K, lysine; L, leucine; 
M, methionine; N, asparagine; P, proline; Q, glutamine; R, arginine; S, serine; T, threonine; 
V, valine; W, tryptophan; Y, tyrosine; Z, glutamine or glutamic acid. See L. Stryer, 
Biochemistry, 1988, W. H. Freeman and Company, New York. 

[00078] The computer sequence comparisons made among the nitrilase polypeptide 
10 sequences of the invention resulted in the identification of these motifs within each amino 
acid sequence: 
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[00079] The following residues (those that are underlined) are completely conserved 

among all of the identified nitrilases: the third amino acid in the first motif or region (E, 
glutamate); the second residue in the second motif (R, arginine); the third residue in the 
second motif (K, lysine); the third residue in the third motif (C, cysteine); and the fifth 
residue in the third motif (E, glutamate). 

[00080] In the boxes, upper case letters indicate 90% or greater consensus among the 

nitrilases of the invention, while lower case letters indicate 50% or greater consensus. An 
italicized letter indicates 30% or greater consensus among the nitrilases of the invention. A 
dot in a box indicates a residue which is not conserved. 

[00081] The sequences of nitrilases in the nitrilase branch of the nitrilase superfamily 

were described as having a catalytic triad in the Pace and Brenner article (Genome Biology, 
2001, Vol. 2, No. 1, pp. 1-9). However, the catalytic triad regions of the nitrilases of this 
invention differ from those previously identified in the Pace and Brenner reference in the 
following ways: 

[00082] Differences in the first motif : The F in the first box of the first motif is 

conserved in 90% of the nitrilases of the invention, rather than in only 50% of those 
previously identified. The fourth residue of the first motif is a "t", threonine in the nitrilases 
of this invention, and it is found at 50% or greater consensus. However, that residue was 
identified by Pace and Brenner as "a" (alanine). The last residue of the first motif was 

20 
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identified as "f 1 (phenylalanine) and was indicated to occur at 50% or greater consensus. 
However, the nitrilases of this invention only show "f ' (phenylalanine occurring at 30% 
consensus. 

[00083] Differences in the second motif: There is an "r" (arginine) in the first box of 

5 the second motif of the nitrilases of this invention. However, the Pace and Brenner 

consensus shows an "h" (histidine) in that position. The "R" (arginine) in the second box is 
completely conserved in the nitrilases of the present invention, however that residue only 
appears at 90% consensus in the Pace and Brenner reference. The "L" (leucine) in the fourth 
box of the second motif is conserved in 90% or more of the nitrilases of this invention. 

10 However, the Pace and Brenner nitrilases only showed conservation of that residue in 50% of 
the sequences. Similarly, the "P" (proline) at the sixth box of the second motif is conserved 
in 90% or more of the nitrilases of this invention. However, the Pace and Brenner nitrilases 
only showed conservation of that residue in 50% of the sequences. 
[00084] Differences in the third motif: The "L in the first box is conserved at 90% or 

15 greater in the nitrilases of the invention. However, the Pace and Brenner reference only 

shows that residue appearing 50% of the time. Finally, the sixth box in the third motif in the 
nitrilases of the invention show a histidine 50% of the time or more. However, the Pace and 
Brenner reference indicates that that position shows an asparagine ("n") 50% of the time. 
[00085] The invention provides for an isolated polypeptide having nitrilase activity which 

20 polypeptide comprises three regions, wherein the first region comprises five amino acids and 
wherein the first amino acid of the first region is F and the fourth amino acid of the first 
region is T. The invention also provides for an isolated polypeptide having nitrilase activity 
which polypeptide comprises three regions, wherein the second region comprises seven 
amino acids and wherein the first amino acid of the second region is R, wherein the second 

25 amino acid of the second region is R, and wherein the sixth amino acid of the second region 
is P. The invention also provides for an isolated polypeptide having nitrilase activity which 
polypeptide comprises three regions, wherein the third region comprises nine amino acids 
and wherein the first amino acid of the third region is L and the sixth amino acid of the third 
region is H. 

30 [00086] The invention also provides for an isolated polypeptide having nitrilase activity 
which polypeptide comprises three consenus subsequences, wherein the first consensus 
subsequence is FPETF, wherein the second consensus subsequence is RRKLXPT, and 
wherein the third consensus subsequence is LXCWEHXXP. 

21 
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[00087] The invention also provides for an isolated polypeptide having nitrilase activity 
which polypeptide comprises three consenus subsequences, wherein the first consensus 
subsequence is FPEXX, wherein the second consensus subsequence is XRKLXPT, and 
wherein the third consensus subsequence is LXCWEXXXP, 

[00088] In accordance with the present invention, methods are provided for producing 
enantiomerically pure ot-substituted carboxylic acids. The enantiomerically pure a- 
substituted carboxylic acids produced by the methods of the present invention have the 
following structure: 



wherein: 

Ri & R 2 and Ri and R2 are otherwise independently -H, substituted or unsubstituted 
alkyl, alkenyl, alkynyl, aryl, heteroaryl, cycloalkyl, or heterocyclic, wherein said substituents 
are lower alkyl, hydroxy, alkoxy, amino, mercapto, cycloalkyl, heterocyclic, aryl, heteroaryl, 
1 5 aryloxy, or halogen or optionally Ri and R 2 are directly or indirectly covalently joined to form 
a functional cyclic moiety, and E is -N(R X >2 or -OH, wherein each R x is independently — H or 
lower alkyL 

[00089] As used herein, the term "alkyl" refers to straight or branched chain or cyclic 
hydrocarbon groups of from 1 to 24 carbon atoms, including methyl, ethyl, n-propyl, 
20 isopropyl, n-butyl, isobutyl, tert-butyl, n-pentyl, n-hexyl, and the like. The term "lower 

alkyl" refers to monovalent straight or branched chain or cyclic radicals of from one to about 
six carbon atoms. 

[00090] As used herein, "alkenyl" refers to straight or branched chain or cyclic 
hydrocarbon groups having one or more carbon-carbon double bonds, and having in the range 
25 of about 2 to about 24 carbon atoms. 

[00091] As used herein, "alkynyl" refers to straight or branched chain or cyclic 
hydrocarbon groups having at least one carbon-carbon triple bond, and having in the range of 
about 2 to about 24 carbon atoms. 

[00092] As used herein, "cycloalkyl" refers to cyclic hydrocarbon groups containing in the 
30 range of about 3 to about 14 carbon atoms. 



10 




R 2 



WO 03/097810 PCT/US03/15712 

[00093] As used herein, "heterocyclic" refers to cyclic groups containing one or more 
heteroatoms (e.g., N, O, S, P, Se, B, etc.) as part of the ring structure, and having in the range 
of about 3 to about 14 carbon atoms. 

[00094] As used herein, "aryl" refers to aromatic groups (i.e., cyclic groups with 
5 conjugated double-bond systems) having in the range of about 6 to about 14 carbon atoms. 
[00095] As used herein with respect to a chemical group or moiety, the term "substituted" 
refers to such a group or moiety further bearing one or more non-hydrogen substituents. 
Examples of such substituents include, without limitation, oxy (e.g., in a ketone, aldehyde, 
ether, or ester), hydroxy, alkoxy (of a lower alkyl group), amino, thio, mercapto (of a lower 
10 alkyl group), cycloalkyl, substituted cycloalkyl, heterocyclic, substituted heterocyclic, aryl, 
substituted aryl, heteroaryl, substituted heteroaryl, aryloxy, substituted aryloxy, halogen, 
trifluoromethyl, cyano, nitro, nitrone, amino, amido, -C(0)H, acyl, oxyacyl, carboxyl, 
carbamate, sulfonyl, sulfonamide, sulfuryl, and the like. 

[00096] In preferred aspects, the enantiomerically pure a-substituted carboxylic acid 
L5 produced by the methods of the present invention is an a-amino acid or a-hydroxy acid. In 
some aspects, the enantiomerically pure a-amino acid is D-phenylalanine, D-phenylglycine, 
L-methylphenylglycine, L-te/t-leucine, D-alanine, or D-hydroxynorleucine ((S)-2-amino-6- 
hydroxy hexanoic acid), R-panto lactone, 2-chloromandelic acid, or (*S)- or (ii)-mandelic acid 
and the enantiomerically pure a-hydroxy acid is (6)-cyclohexylmandelic acid. As used 
10 herein, a "small molecule" encompasses any molecule having a molecular weight from at 
least 25 Daltons. 

[00097] The term "about" is used herein to mean approximately, roughly, around, or in the 
region of. When the term "about" is used in conjunction with a numerical range, it modifies 
that range by extending the boundaries above and below the numerical values set forth. In 
15 general, the term "about" is used herein to modify a numerical value above and below the 
stated value by a variance of 20 percent up or down (higher or lower). 
[00098] As used herein, the word "or" means any one member of a particular list and also 
includes any combination of members of that list. 

[00099] The phrase "nucleic acid" as used herein refers to a naturally occurring or 
!0 synthetic oligonucleotide or polynucleotide, whether DNA or RNA or DNA-RNA hybrid, 
single-stranded or double-stranded, sense or antisense, which is capable of hybridization to a 
complementary nucleic acid by Watson-Crick base-pairing. Nucleic acids of the invention 
can also include nucleotide analogs (e.g., BrdU), and non-phosphodiester internucleoside 
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linkages (e.g., peptide nucleic acid (PNA) or thiodiester linkages). In particular, nucleic acids 
can include, without limitation, DNA, RNA, cDNA, gDNA, ssDNA or dsDNA or any 
combination thereof. In some aspects, a "nucleic acid" of the invention includes, for 
example, a nucleic acid encoding a polypeptide as set forth in the Group B amino acid 
5 sequences, and variants thereof. The phrase "a nucleic acid sequence" as used herein refers to 
a consecutive list of abbreviations, letters, characters or words, which represent nucleotides. 
In one aspect, a nucleic acid can be a "probe" which is a relatively short nucleic acid, usually 
less than 100 nucleotides in length. Often a nucleic acid probe is from about 50 nucleotides 
in length to about 10 nucleotides in length. A "target region" of a nucleic acid is a portion of 

10 a nucleic acid that is identified to be of interest. 

[000100] A "coding region" of a nucleic acid is the portion of the nucleic acid which is 
transcribed and translated in a sequence-specific manner to produce into a particular 
polypeptide or protein when placed under the control of appropriate regulatory sequences. 
The coding region is said to encode such a polypeptide or protein. 

15 [000101] The term "gene" refers to a coding region operably joined to appropriate 

regulatory sequences capable of regulating the expression of the polypeptide in some manner. 
A gene includes untranslated regulatory regions of DNA (e.g., promoters, enhancers, 
repressors, etc.) preceding (upstream) and following (downstream) the coding region (open 
reading frame, ORF) as well as, where applicable, intervening sequences (i.e., introns) 

20 between individual coding regions (i.e., exons). 

[000102] "Polypeptide" as used herein refers to any peptide, oligopeptide, polypeptide, 
gene product, expression product, or protein. A polypeptide is comprised of consecutive 
amino acids. The term "polypeptide" encompasses naturally occurring or synthetic 
molecules. 

15 [000103] In addition, as used herein, the term "polypeptide" refers to amino acids joined to 
each other by peptide bonds or modified peptide bonds, e.g., peptide isosteres, and may 
contain modified amino acids other than the 20 gene-encoded amino acids. The polypeptides 
can be modified by either natural processes, such as post-translational processing, or by 
chemical modification techniques which are well known in the art. Modifications can occur 

50 anywhere in the polypeptide, including the peptide backbone, the amino acid side-chains and 
the amino or carboxyl termini. It will be appreciated that the same type of modification can 
be present in the same or varying degrees at several sites in a given polypeptide. Also a 
given polypeptide can have many types of modifications. Modifications include, without 
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limitation, acetylation, acylation, ADP-ribosylation, amidation, covalent cross-linking or 
cyclization, covalent attachment of flavin, covalent attachment of a heme moiety, covalent 
attachment of a nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid 
derivative, covalent attachment of a phosphytidylinositol, disulfide bond formation, 
5 demethylation, formation of cysteine or pyroglutamate, formylation, gamma-carboxylation, 
glycosylation, GPI anchor formation, hydroxylation, iodination, methylation, myristolyation, 
oxidation, pergylation, proteolytic processing, phosphorylation, prenylation, racemization, 
selenoylation, sulfation, and transfer-RNA mediated addition of amino acids to protein such 
as arginylation. (See Proteins — Structure and Molecular Properties 2nd Ed., T.E. Creighton, 
10 W.H. Freeman and Company, New York (1993); Posttranslational Covalent Modification of 
Proteins, B.C. Johnson, Ed., Academic Press, New York, pp. 1-12 (1983)). 
[000104] As used herein, the term "amino acid sequence" refers to a list of abbreviations, 
letters, characters or words representing amino acid residues. 

[000105] As used herein, the term "isolated" means that a material has been removed from 

1 5 its original environment. For example, a naturally-occurring polynucleotide or polypeptide 
present in a living animal is not isolated, but the same polynucleotide or polypeptide, 
separated from some or all of the coexisting materials in the natural system, is isolated. Such 
polynucleotides can be part of a vector and/or such polynucleotides or polypeptides could be 
part of a composition, and would be isolated in that such a vector or composition is not part 

20 of its original environment. 

[000106] As used herein with respect to nucleic acids, the term "recombinant" means that 
the nucleic acid is covalently joined and adjacent to a nucleic acid to which it is not adjacent 
in its natural environment. Additionally, as used herein with respect to a particular nucleic 
acid in a population of nucleic acids, the term "enriched" means that the nucleic acid 

25 represents 5% or more of the number of nucleic acids in the population of molecules. 

Typically, the enriched nucleic acids represent 15% or more of the number of nucleic acids in 
the population of molecules. More typically, the enriched nucleic acids represent 50%, 90% 
or more of the number of nucleic acids in the population molecules. 
[000107] "Recombinant" polypeptides or proteins refer to polypeptides or proteins 

30 produced by recombinant DNA techniques, i.e., produced from cells transformed by an 
exogenous recombinant DNA construct encoding the desired polypeptide or protein. 
"Synthetic" polypeptides or proteins are those prepared by chemical synthesis (e.g., solid- 
phase peptide synthesis). Chemical peptide synthesis is well known in the art (see, e.g., 
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Merrifield (1963), Am. Chem. Soc. 85:2149-2154; Geysen et al. (1984), Proc. Natl Acad. 
Set, USA 81:3998) and synthesis kits and automated peptide synthesizer are commercially 
available {e.g., Cambridge Research Biochemicals, Cleveland, United Kingdom; Model 431 A 
synthesizer from Applied Biosystems, Inc., Foster City, CA). Such equipment provides ready 
5 access to the peptides of the invention, either by direct synthesis or by synthesis of a series of 
fragments that can be coupled using other known techniques. 

[000108] As used herein with respect to pairs of nucleic acid or amino acid sequences, 
"identity" refers to the extent to which the two sequences are invariant at positions within the 
sequence which can be aligned. The percent identity between two given sequences can be 

10 calculated using an algorithm such as BLAST (Altschul et al. (1990), J. Mol Biol 215:403- 
410). See www.ncbi.nlm.nih.gov/Education/BLASTinfo. When using the BLAST algorithm 
for sequences no longer than 250 nucleotides or about 80 amino acids ("short queries"), the 
search parameters can be as follows: the filter is off, the scoring matrix is PAM30, the word 
size is 3 or 2, the E value is 1000 or more, and the gap costs are 1 1, 1 . For sequences longer 

15 than 250 nucleotides or 80 amino acid residues, the default search parameters can be used. 
The BLAST website provides advice for special circumstances which is to be followed in 
such circumstances. 

[000109] As used herein, "homology" has the same meaning as "identity" in the context of 
nucleotide sequences. However, with respect to amino acid sequences, "homology" includes 
20 the percentage of identical and conservative amino acid substitutions. Percentages of 

homology can be calculated according to the algorithms of Smith and Waterman (1981), Adv. 
Appl Math. 2:482. 

[000110] As used herein in the context of two or more nucleic acid sequences, two 
sequences are "substantially identical" when they have at least 99.5% nucleotide identity, 

25 when compared and aligned for maximum correspondence, as measured using the known 
sequence comparison algorithms described above. Ih addition, for purposes of determining 
whether sequences are substantially identical, synonymous codons in a coding region may be 
treated as identical to account for the degeneracy of the genetic code. Typically, the region 
for determination of substantial identity must span at least about 20 residues, and most 

30 commonly the sequences are substantially identical over at least about 25-200 residues. 
[000111] As used herein in the context of two or more amino acid sequences, two 
sequences are "substantially identical" when they have at least 99.5% identity, when 
compared and aligned for maximum correspondence, as measured using the known sequence 
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comparison algorithms described above. In addition, for purposes of determining whether 
sequences are substantially identical, conservative amino acid substitutions may be treated as 
identical if the polypeptide substantially retains its biological function. 
[000112] "Hybridization" refers to the process by which a nucleic acid strand joins with a 
complementary strand through hydrogen bonding at complementary bases. Hybridization 
assays can be sensitive and selective so that a particular sequence of interest can be identified 
even in samples in which it is present at low concentrations. Stringent conditions are defined 
by concentrations of salt or formamide in the prehybridization and hybridization solutions, or 
by the hybridization temperature, and are well known in the art. Stringency can be increased 
by reducing the concentration of salt, increasing the concentration of formamide, or raising 
the hybridization temperature. In particular, as used herein, "stringent hybridization 
conditions" include 42°C in 50% formamide, 5X SSPE, 0.3% SDS, and 200 ng/ml sheared 
and denatured salmon sperm DNA, and equivalents thereof. Variations on the above ranges 
and conditions are well known in the art. 

[000113] The term "variant" refers to polynucleotides or polypepti des of the invention 
modified at one or more nucleotides or amino acid residues (respectively) and wherein the 
encoded polypeptide or polypeptide retains nitrilase activity. Variants can be produced by 
any number of means including, for example, eixor-prone PCR, shuffling, oligonucleotide- 
directed mutagenesis, assembly PCR, sexual PGR mutagenesis, in vivo mutagenesis, cassette 
mutagenesis, recursive ensemble mutagenesis, exponential ensemble mutagenesis, site- 
specific mutagenesis, gene reassembly, gene site-saturated mutagenesis or any combination 
thereof 

[0001 14] Methods of making peptidomimetics based upon a known sequence is described, 
for example, in U.S. Patent Nos. 5,631,280; 5,612,895; and 5,579,250. Use of 
peptidomimetics can involve the incorporation of a non-amino acid residue with non-amide 
linkages at a given position. One aspect of the present invention is a peptidomimetic wherein 
the compound has a bond, a peptide backbone or an amino acid component replaced with a 
suitable mimic. Examples of unnatural amino acids which may be suitable amino acid 
mimics include p-alanine, L-a-amino butyric acid, L-y-amino butyric acid, L-a-amino 
isobutyric acid, L-s- amino caproic acid, 7-amino heptanoic acid, L-aspartic acid, L-glutamic 
acid, N-e-Boc-N-a-CBZ-L-lysine, N-s-Boc-N-a-Fmoc-L-lysine, L-methionine sulfone, L- 
norleucine, L-norvaline, N-a-Boc-N-5CBZ-L-omithine, N-8-Boc-N-a-CBZ-L-ornithine, 
Boc-p-nitro-L-phenylalanine, Boc-hydroxyproline, Boc-L-thioproline. 
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[000115] As used herein, "small molecule" encompasses a molecule having a molecular 
weight from about 20 daltons to about 1.5 kilodaltons. 

[000116] The molecular biological techniques, such as subcloning, were performed 
using routine methods which would be well known to one of skill in the art. (Sambrook, J . 
5 Fritsch, EF, Maniatis, T. (1989) Molecular Cloning: A Laboratory Manual (2nd ed.) ? Cold 
Spring Harbor Laboratory Press, Plainview NY.). 

Computer Systems 

[0001 17] In one aspect of the invention, any nucleic acid sequence and/or polypeptide 
sequence of the invention can be stored, recorded, and manipulated on any medium which 

10 can be read and accessed by a computer. As used herein, the words "recorded" and "stored" 
refer to a process for storing information on a computer medium. Another aspect of the 
invention is a computer readable medium having recorded thereon at least 2, 5, 10, 15 or 20 
nucleic acid sequences as set forth in SEQ ID NOS: 1-386, and sequences substantially 
identical thereto. In a further aspect, another aspect is the comparison among and between 

15 nucleic acid sequences or polypeptide sequences of the invention and the comparison 

between sequences of the invention and other sequences by a computer. Computer readable 
media include magnetically readable media, optically readable media, electronically readable 
media and magnetic/optical media. For example, the computer readable media may be a hard 
disk, a floppy disk, a magnetic tape, CD-ROM, Digital Versatile Disk (DVD), Random 

20 Access Memory (RAM), or Read Only Memory (ROM) as well as other types of other media 
known to those skilled in the art. 

[000118] Aspects of the invention include systems {e.g., internet based systems), 
particularly computer systems which store and manipulate the sequence information 
described herein. As used herein, "a computer system" refers to the hardware components, 

25 software components, and data storage components used to analyze a sequence (either 
nucleic acid or polypeptide) as set forth in at least any one of SEQ ID NOS: 1-386 and 
sequences substantially identical thereto. The computer system typically includes a processor 
for processing, accessing and manipulating the sequence data. The processor can be any 
well-known type of central processing unit, such as, for example, the Pentium III from Intel 

30 Corporation, or similar processor from Sun, Motorola, Compaq, AMD or International 
Business Machines. 
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[000119] Typically the computer system is a general purpose system that comprises the 
processor and one or more internal data storage components for storing data, and one or more 
data retrieving devices for retrieving the data stored on the data storage components. 

[000120] In one particular aspect, the computer system includes a processor connected to a 
5 bus which is connected to a main memory (preferably implemented as RAM) and one or 

more internal data storage devices, such as a hard drive and/or other computer readable media 
having data recorded thereon. In some aspects, the computer system further includes one or 
more data retrieving device for reading the data stored on the internal data storage devices. 

[000121] The data retrieving device may represent, for example, a floppy disk drive, a 
10 compact disk drive, a magnetic tape drive, or a modem capable of connection to a remote 
data storage system (e.g., via the internet) etc. In some aspects, the internal data storage 
device is a removable computer readable medium such as a floppy disk, a compact disk, a 
magnetic tape, etc. containing control logic and/or data recorded thereon. The computer 
system may advantageously include or be programmed by appropriate software for reading 
15 the control logic and/or the data from the data storage component once inserted in the data 
retrieving device. 

[000122] The computer system includes a display which is used to display output to a 
computer user. It should also be noted that the computer system can be linked to other 
computer systems in a network or wide area network to provide centralized access to the 
20 computer system. In some aspects, the computer system may further comprise a sequence 
comparison algorithm. A "sequence comparison algorithm" refers to one or more programs 
which are implemented (locally or remotely) on the computer system to compare a nucleotide 
sequence with other nucleotide sequences and/or compounds stored within a data storage 
means. 

25 Uses of Nitrilases 

[000123] Nitrilases have been identified as key enzymes for the production of chiral a- 
hydroxy acids, which are valuable intermediates in the fine chemicals industry, and as 
pharmaceutical intermediates. The nitrilase enzymes of the invention are useful to catalyze 
the stereoselective hydrolysis of cyanohydrins and aminonitriles, producing chiral a- 

30 hydroxy- and a- amino acids, respectively. 
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[000124] Stereoselective enzymes provide a key advantage over chemical resolution 
methods as they do not require harsh conditions and are more environmentally compatible. 
The use of nitrilases is of particular interest for the production of chiral amino acids and a- 
hydroxy acids. Using a stereoselective nitrilase, dynamic resolution conditions can be 
5 established, due to the racemisation of the substrate under aqueous conditions. Thus 100% 
theoretical yields are achievable. 

[000125] This invention is directed to the nitrilases which have been discovered and 
isolated from naturally occurring sources. This invention is also directed to evolving novel 
genes and gene pathways from diverse and extreme environmental sources, hi an effort to 

10 develop the most extensive assortment of enzymes available, DNA was extracted directly 
from samples that have been collected from varying habitats around the globe. From these 
efforts, the largest collection of environmental genetic libraries in the world was developed. 
Through extensive high-throughput screening of these libraries, 192 new sequence-unique 
nitrilase enzymes have been discovered to date. Previous to this invention, fewer than 20 

15 microbial-derived nitrilases had been reported in the literature and public databases. 

[000126] Biocatalysts, such as nitrilases, play an important role in catalyzing metabolic 
reactions in living organisms. In addition, biocatalysts have found applications in the 
chemical industry, where they can perform many different reactions. Some examples of the 
advantages of the use of nitrilases is that they provide: high enantio-, chemo- and regio- 

20 selectivity; they function under mild reaction conditions; they provide direct access to 
products — with minimal protection; they have high catalytic efficiencies; they produce 
reduced waste compared with the chemical alternatives; they are easily immobilized as 
enzymes or cells; they are recoverable, recyclable and are capable of being manipulated via 
molecular biological techniques; they can be regenerated in whole cell processes; they are 

25 tolerant to organic solvents; and importantly, they can be evolved or optimized. Optimized 
nitrilases are presented herein as working examples of the invention. 

[000127] Nitrilases catalyze the hydrolysis of nitrile moieties generating the corresponding 
carboxylic acid. Conventional chemical hydrolysis of nitriles requires strong acid or base and 
high temperature. However, one advantage of the invention is that nitrilases are provided 
30 which perform this reaction under mild conditions. Wide ranges of nitrile substrates can be 
transformed by nitrilases with high enantio-, chemo- and regio- selectivity. 
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Table 1 - Some characteristics of Nitrilases of the Invention 



Previously Discovered New Nitrilases 
Nitrilases 

Limitations New Features Benefits 


< 20 reported 


>180 newly discovered 


Access to a wider 
substrate range 


Homologous 


Unique nitrilases, many 
with little homology to 
previously known nitrilases 


Narrow substrate 
Activity spectrum 


Broad substrate 
activity spectrum 


Very few shown to be 
enantioselective 


Enantioselective; both 
enantiomers accessible 


Product with high 
enantiomeric excess and 
minimal waste production 


Limited stability profile 


Stable in a variety of 
conditions 


Potential use in a wide range 
of process conditions 


Inconsistent supply 


Consistent supply 


Reliable source of product 


Not applicable 


Amenable to optimization 


Good source material leads to 
better product 



[000128] Dynamic Kine tic Resolution: The use of the nitrilases allows discrimination 
between two rapidly equilibrating enantiomers to give a single product in 100% theoretical 
yield. Nitrilases are utilized for dynamic resolution of key cyanohydrins and aminonitriles to 
produce enantiomerically pure a-carboxylic and a-amino acids. Newly discovered nitrilases 
disclosed herein yield products with >95% enantiomeric excess (ee) with and >95% yield. 
The nitrilases perform this transformation efficiently under mild conditions in aqueous 
solution or in the presence of organic solvent. 
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a-Amino Acids 

[000129] These products shown above also include the opposite enanatiomers, although 
they are not shown, hi one aspect, the invention provides an isolated nucleic acid having a 
5 sequence as set forth in any one of the Group A nucleic acid sequences, having a sequence 
substantially identical thereto, or having a sequence complementary thereto. 

[000130] In another aspect, the invention provides an isolated nucleic acid including at least 
20 consecutive nucleotides identical to a portion of a nucleotide sequence as set forth in the 
Group A nucleic acid sequences, having a sequence substantially identical thereto, or having 
10 a sequence complementary thereto. 

[000131] In another aspect, the invention provides an isolated nucleic acid encoding a 
polypeptide having a sequence as set forth in the Group B amino acid sequences, or having a 
sequence substantially identical thereto. 

[000132] In another aspect, the invention provides an isolated nucleic acid encoding a 
15 polypeptide having at least 10 consecutive amino acids identical to a portion of a sequence as 
set forth in the Group B amino acid sequences, or having a sequence substantially identical 
thereto. 

[000133] In yet another aspect, the invention provides a substantially purified polypeptide 
comprising consecutive amino acid residues having a sequence as set forth in the Group B 
20 amino acid sequences, or having a sequence substantially identical thereto. 
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[000134] In another aspect, the invention provides an isolated antibody that specifically 
binds to a polypeptide of the invention. The invention also provides for a fragment of the 
antibody which retains the ability to specifically bind the polypeptide. 

[000135] In another aspect, the invention provides a method of producing a polypeptide 
5 having a sequence as set forth in the Group B amino acid sequences, and sequences 

substantially identical thereto. The method includes introducing a nucleic acid encoding the 
polypeptide into a host cell, wherein the nucleic acid is operably joined to a promoter, and 
culturing the host cell under conditions that allow expression of the nucleic acid. 

[000136] In another aspect, the invention provides a method of producing a polypeptide 
L0 having at least 10 consecutive amino acids from a sequence as set forth in the Group B amino 
acid sequences, and sequences substantially identical thereto. The method includes 
introducing a nucleic acid encoding the polypeptide into a host cell, wherein the nucleic acid 
is operably joined to a promoter, and culturing the host cell under conditions that allow 
expression of the nucleic acid, thereby producing the polypeptide. 

[5 [000137] In another aspect, the invention provides a method of generating a variant of a 
nitrilase, including choosing a nucleic acid sequence as set forth in the Group A nucleic acid 
sequences, and changing one or more nucleotides in the sequence to another nucleotide, 
deleting one or more nucleotides in the sequence, or adding one or more nucleotides to the 
sequence. 

10 [000138] In another aspect, the invention provides assays for identifying functional variants 
of the Group B amino acid sequences that retain the enzymatic function of the polypeptides 
of the Group B amino acid sequences. The assays include contacting a polypeptide 
comprising consecutive amino acid residues having a sequence identical to a sequence of the 
Group B amino acid sequences or a portion thereof, having a sequence substantially identical 

15 to a sequence of the Group B amino acid sequences or a portion thereof, or having a sequence 
which is a variant of a sequence of the Group B amino acid sequences that retains nitrilase 
activity, with a substrate molecule under conditions which allow the polypeptide to function, 
and detecting either a decrease in the level of substrate or an increase in the level of a specific 
reaction product of the reaction between the polypeptide and the substrate, thereby 

\0 identifying a functional variant of such sequences. 
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Modification of Polypeptides of the Invention 

[000139] Enzymes are highly selective catalysts. Their hallmark is the ability to catalyze 
reactions with exquisite stereo-selectivity, regio-selectivity, and chemo-selectivity that is 
unparalleled in conventional synthetic chemistry. Moreover, enzymes are remarkably 
versatile. They can be tailored to function in organic solvents, operate at extreme pHs (for 
example, acidic or basic conditions) extreme temperatures (for example, high temperatures 
and low temperatures), extreme salinity levels (for example, high salinity and low salinity), 
and catalyze reactions with compounds that can be structurally unrelated to their natural, 
physiological substrates except for the enzymatic active site. 

[000140] The invention provides methods for modifying polypeptides having nitrilase 
activity or polynucleotides encoding such polypeptides in order to obtain new polypeptides 
which retain nitrilase activity but which are improved with respect to some desired 
characteristic. Such improvements can include the ability to function (i.e., exhibit nitrilase 
1 5 activity) in organic solvents, operate at extreme or uncharacteristic pHs, operate at extreme or 
uncharacteristic temperatures, operate at extreme or uncharacteristic salinity levels, catalyze 
reactions with different substrates, etc. 

[000141] The present invention directed to methods of using nitrilases so as to exploit the 
unique catalytic properties of these enzymes. Whereas the use of biocatalysts (i.e., purified or 
crude enzymes) in chemical transformations normally requires the identification of a 
particular biocatalyst that reacts with a specific starting compound, the present invention uses 
selected biocatalysts and reaction conditions that are specific for functional groups that are 
present in many starting compounds. Each biocatalyst is specific for one functional group, or 
several related functional groups, and can react with many starting compounds containing 
this functional group. 

[000142] Enzymes react at specific sites within a starting compound without affecting the 
rest of the molecule, a process which is very difficult to achieve using traditional chemical 
methods. This high degree of specificity provides the means to identify a single active 
compound within a library of compounds. The library is characterized by the series of 
30 biocatalytic reactions used to produce it, a so-called "biosynthetic history." Screening the 
library for biological activities and tracing the biosynthetic history identifies the specific 
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reaction sequence producing the active compound. The reaction sequence is repeated and the 
structure of the synthesized compound determined. This mode of identification, unlike other 
synthesis and screening approaches, does not require immobilization technologies, and 
compounds can be synthesized and tested free in solution using virtually any type of 
5 screening assay. It is important to note, that the high degree of specificity of enzyme 

reactions on functional groups allows for the "tracking" of specific enzymatic reactions that 
make up the biocatalytically produced library. (For further teachings on modification of 
molecules, including small molecules, see PCT Application No. PCT/US94/09174, herein 
incorporated by reference in its entirety). 

10 [000143] In one exemplification, the invention provides for the chimerization of a family of 
related nitrilase genes and their encoded family of related products. Thus according to this 
aspect of the invention, the sequences of a plurality of nitrilase nucleic acids (e.g., the Group 
A nucleic acids) serve as nitrilase "templates" which are aligned using a sequence comparison 
algorithm such as those described above. One or more demarcation points are then identified 

15 in the aligned template sequences, which are located at one or more areas of homology. The 
demarcation points can be used to delineate the boundaries of nucleic acid building blocks, 
which are used to generate chimeric nitrilases. Thus, the demarcation points identified and 
selected in the nitrilase template molecules serve as potential chimerization points in the 
assembly of the chimeric nitrilase molecules. 

20 [000144] Typically, a useful demarcation point is an area of local identity between at least 
two progenitor templates, but preferably the demarcation point is an area of identity that is 
shared by at least half of the templates, at least two thirds of the templates, at least three 
fourths of the templates, or at nearly all of the templates. 

[000145] The building blocks, which are defined by the demarcation points, can then be 
25 mixed (either literally, in solution, or theoretically, on paper or in a computer) and 

reassembled to form chimeric nitrilase genes. In one aspect, the gene reassembly process is 
performed exhaustively in order to generate an exhaustive library of all possible 
combinations. In other words, all possible ordered combinations of the nucleic acid building 
blocks are represented in the set of finalized chimeric nucleic acid molecules. At the same 
30 time, however, the order of assembly of each building block in the 5' to 3 1 direction in each 
combination is designed to reflect the order in the templates, and to reduce the production of 
unwanted, inoperative products. 
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[000146] In some aspects, the gene reassembly process is performed systematically, in 
order to generate a compartmentalized library with compartments that can be screened 
systematically, e.g., one by one. In other words, the invention provides that, through the 
selective and judicious use of specific nucleic acid building blocks, coupled with the selective 
and judicious use of sequentially stepped assembly reactions, an experimental design can be 
achieved where specific sets of chimeric products are made in each of several reaction 
vessels. This allows a systematic examination and screening procedure to be performed. 
Thus, it allows a potentially very large number of chimeric molecules to be examined 
systematically in smaller groups. 

[000147] In some aspects, the synthetic nature of the step in which the building blocks are 
generated or reassembled allows the design and introduction of sequences of nucleotides 
(e.g., codons or introns or regulatory sequences) that can later be optionally removed in an in 
vitro process (e.g., by mutagenesis) or in an in vivo process (e.g., by utilizing the gene 
splicing ability of a host organism). The introduction of these nucleotides may be desirable 
for many reasons, including the potential benefit of creating a useful demarcation point. 

[000148] The synthetic gene reassembly method of the invention utilizes a plurality of 
nucleic acid building blocks, each of which has two ligatable ends. Some examples of the 
two ligatable ends on each nucleic acid building block includes, but are not limited to, two 
blunt ends, or one blunt end and one overhang, or two overhangs. In a further, non-limiting 
example, the overhang can include one base pair, 2 base pairs, 3 base pairs, 4 base pairs or 
more. 

[000149] A double-stranded nucleic acid building block can be of variable size. Preferred 
sizes for building blocks range from about 1 base pair (bp) (not including any overhangs) to 
about 100,000 base pairs (not including any overhangs). Other preferred size ranges are also 
provided, which have lower limits of from about 1 bp to about 10,000 bp (including every 
integer value in between), and upper limits of from about 2 bp to about 100,000 bp (including 
every integer value in between). 

[000150] According to one aspect, a double-stranded nucleic acid building block is 
generated by first generating two single stranded nucleic acids and allowing them to anneal to 
form a double-stranded nucleic acid building block. The two strands of a double-stranded 
nucleic acid building block may be complementary at every nucleotide apart from any that 
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form an overhang; thus containing no mismatches, apart from any overhang(s). 
Alternatively, the two strands of a double-stranded nucleic acid building block can be 
complementary at fewer than every nucleotide, apart from any overhang(s). In particular, 
mismatches between the strands can be used to introduce codon degeneracy using methods 
5 such as the site-saturation mutagenesis described herein. 

[000151] In vivo shuffling of molecules is also useful in providing variants and can be 
performed utilizing the natural property of cells to recombine multimers. While 
recombination in vivo has provided the major natural route to molecular diversity, genetic 
recombination remains a relatively complex process that involves (1) the recognition of 
10 homologies; (2) strand cleavage, strand invasion, and metabolic steps leading to the 

production of recombinant chiasma; and finally (3) the resolution of chiasma into discrete 
recombined molecules. The formation of the chiasma requires the recognition of 
homologous sequences. 

[000152] Thus, the invention includes a method for producing a chimeric or recombinant 
15 polynucleotide from at least a first polynucleotide and a second polynucleotide in vivo. The 
invention can be used to produce a recombinant polynucleotide by introducing at least a first 
polynucleotide and a second polynucleotide which share at least one region of partial 
sequence homology (e.g., the Group A nucleic acid sequences, and combinations thereof) 
into a suitable host cell. The regions of partial sequence homology promote processes which 
£0 result in sequence reorganization producing a recombinant polynucleotide. Such hybrid 
polynucleotides can result from intermolecular recombination events which promote 
sequence integration between DNA molecules. In addition, such hybrid polynucleotides can 
result from intramolecular reductive reassortment processes which utilize repeated sequences 
to alter a nucleotide sequence within a DNA molecule. 

25 [000153] The invention provides a means for generating recombinant polynucleotides 

which encode biologically active variant polypeptides (e.g., a nitrilase variant). For example, 
a polynucleotide may encode a particular enzyme from one microorganism. An enzyme 
encoded by a first polynucleotide from one organism can, for example, function effectively 
under a particular environmental condition, e.g., high salinity. An enzyme encoded by a 

JO second polynucleotide from a different organism can function effectively under a different 

environmental condition, such as extremely high temperature. A recombined polynucleotide 

containing sequences from the first and second original polynucleotides encodes a variant 
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enzyme which exhibits characteristics of both enzymes encoded by the original 
polynucleotides. Thus, the enzyme encoded by the recombined polynucleotide can function 
effectively under environmental conditions shared by each of the enzymes encoded by the 
first and second polynucleotides, e.g., high salinity and extreme temperatures. 

5 [000154] A variant polypeptide can exhibit specialized enzyme activity not displayed in the 
original enzymes. For example, following recombination and/or reductive reassortment of 
polynucleotides encoding nitrilase activity, the resulting variant polypeptide encoded by a 
recombined polynucleotide can be screened for specialized nitrilase activity obtained from 
each of the original enzymes, i.e., the temperature or pH at which the nitrilase functions. 

10 Sources of the original polynucleotides may be isolated from individual organisms 

("isolates")? collections of organisms that have been grown in defined media ("enrichment 
cultures"), or, uncultivated organisms ("environmental samples"). The use of a culture- 
independent approach to derive polynucleotides encoding novel bioactivities from 
environmental samples is most preferable since it allows one to access untapped resources of 

1 5 biodiversity. The microorganisms from which the polynucleotide may be prepared include 
prokaryotic microorganisms, such as Xanthobacter, Eubacteria and Archaebacteria^ and 
lower eukaryotic microorganisms such as fungi, some algae and protozoa. Polynucleotides 
may be isolated from environmental samples in which case the nucleic acid may be recovered 
without culturing of an organism or recovered from one or more cultured organisms. In one 

20 aspect, such microorganisms may be extremophiles, such as hyperthermophiles, 

psychrophiles, psychrotrophs, halophiles, barophiles and acidophiles. Polynucleotides 
encoding enzymes isolated from extremophilic microorganisms are particularly preferred. 
Such enzymes may function at temperatures above 100°C in terrestrial hot springs and deep 
sea thermal vents, at temperatures below 0°C in arctic waters, in the saturated salt 

25 environment of the Dead Sea, at pH values around 0 in coal deposits and geothermal sulfur- 
rich springs, or at pH values greater than 1 1 in sewage sludge. 

[000155] Examples of mammalian expression systems that can be employed to express 
recombinant proteins include the COS-7, C127, 3T3, CHO, HeLa and BHK cell lines. 
Mammalian expression vectors comprise an origin of replication, a suitable promoter and 
30 enhancer, and also any necessary ribosome binding sites, polyadenylation site, splice donor 
and acceptor sites, transcriptional termination sequences, and 5' flanking nontranscribed 
sequences. DNA sequences derived from the SV40 splice and polyadenylation sites may be 
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used to provide the required nontranscribed genetic elements. U.S. Patent No. 6,054,267 is 
hereby incorporated by reference in its entirety. 



[000156] Host cells containing the polynucleotides of interest can be cultured in 
conventional nutrient media modified as appropriate for activating promoters, selecting 
5 transformants or amplifying genes. The culture conditions, such as temperature, pH and the 
like, are those previously used with the host cell selected for expression, and will be apparent 
to the ordinarily skilled artisan. Clones, which are identified as having a desired enzyme 
activity or other property may then be sequenced to identify the recombinant polynucleotide 
sequence encoding the enzyme having the desired activity or property. 

10 [000157] In one aspect, the invention provides for the isolated nitrilases as either an isolated 
nucleic acid or an isolated polypeptide wherein the nucleic acid or the polypeptide was 
prepared by recovering DNA from a DNA population derived from at least one uncultivated 
microorganism, and transforming a host with recovered DNA to produce a library of clones 
which is screened for the specified protein, e.g. nitrilase activity. U.S. Patent No. 6,280,926, 

1 5 Short, provides descriptions of such methods and is hereby incorporated by reference in its 
entirety for all purposes. 

[000158] Therefore, in a one aspect, the invention relates to a method for producing a 
biologically active recombinant nitrilase polypeptide and screening such a polypeptide for 
desired activity or property by: 

20 1) introducing at least a first nitrilase polynucleotide and a second nitrilase 

polynucleotide, said at least first nitrilase polynucleotide and second nitrilase polynucleotide 
sharing at least one region of sequence homology, into a suitable host cell; 

2) growing the host cell under conditions which promote sequence reorganization 
resulting in a recombinant nitrilase polynucleotide; 

25 3) expressing a recombinant nitrilase polypeptide encoded by the recombinant 

nitrilase polynucleotide; 

4) screening the recombinant nitrilase polypeptide for the desired activity or 
property; and 

5) isolating the recombinant nitrilase polynucleotide encoding the recombinant 
30 nitrilase polypeptide. 
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[000159] Examples of vectors which may be used include viral particles, baculovirus, 
phage, plasmids, phagemids, cosmids, fosmids, bacterial artificial chromosomes, viral DNA 
(e.g., vaccinia, adenovirus, fowlpox virus, pseudorabies and derivatives of SV40), PI -based 
artificial chromosomes, yeast plasmids, yeast artificial chromosomes, and any other vectors 
5 specific for the hosts of interest (e.g. , Bacillus, Aspergillus and yeast). Large numbers of 
suitable vectors are known to those of skill in the art, and are commercially available. 
Examples of bacterial vectors include pQE vectors (Qiagen, Valencia, CA); pBluescript 
plasmids, pNH vectors, and lambda-ZAP vectors (Stratagene, La Jolla, CA); and pTRC99a, 
pKK223-3, pDR540, and pRIT2T vectors (Pharmacia, Peapack, NJ). Examples of eukaryotic 
10 vectors include pXTl and pSG5 vectors (Stratagene, La Jolla, CA); and pSVK3, pBPV, 

pMSG, and pSVLSV40 vectors (Pharmacia, Peapack, NJ). However, any other plasmid or 
other vector may be used so long as they are replicable and viable in the host. 

[000160] A preferred type of vector for use in the present invention contains an f-factor (or 
fertility factor) origin of replication. The f-factor in E. colt is a plasmid which effects high 
1 5 frequency transfer of itself during conjugation and less frequent transfer of the bacterial 
chromosome itself. A particularly preferred aspect is to use cloning vectors referred to as 
"fosmids" or bacterial artificial chromosome (BAG) vectors. These are derived from E. coli 
f-factor which is able to stably integrate large segments of genomic DNA. 

[000161] The DNA sequence in the expression vector is operably joined to appropriate 
20 expression control sequences, including a promoter, to direct RNA synthesis. Useful 

bacterial promoters include lad, lacZ, T3, T7, gpt, lambda Pr, P l and trp. Useful eukaryotic 
promoters include CMV immediate early, HSV thymidine kinase, early and late S V40, LTRs 
from retrovirus, and mouse metallothionein-I. Selection of the appropriate vector and 
promoter is well within the level of ordinary skill in the art. The expression vector also 
25 contains a ribosome binding site for translation initiation and a transcription terminator. The 
vector may also include appropriate sequences for amplifying expression. Promoter regions 
can be selected from any desired gene using CAT (chloramphenicol transferase) vectors or 
other vectors with selectable markers. 

[000162] In addition, the expression vectors can contain one or more selectable marker 
30 genes to provide a phenotypic trait for selection of transformed host cells. Useful selectable 
markers include dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, or 
tetracycline or ampicillin resistance in E. coli. 
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[000163] The vector may be introduced into the host cells using any of a variety of 
techniques, including transformation, transfection, transduction, viral infection, gene guns, or 
Ti-mediated gene transfer. Particular methods include calcium phosphate transfection, 
DEAE-Dextran mediated transfection, lipofection, or electroporation 

5 [000164] Reductive Reassortment - In another aspect, variant nitrilase polynucleotides can 
be generated by the process of reductive reassortment. Whereas recombination is an "inter- 
molecular" process which, in bacteria, is generally viewed as a "recA-dependent" 
phenomenon, the process of "reductive reassortment" occurs by an "intra-molecular", recA- 
independent process. In this aspect, the invention can rely on the ability of cells to mediate 

10 reductive processes to decrease the complexity of quasi-repeated sequences in the cell by 

deletion. The method involves the generation of constructs containing consecutive repeated 
or quasi-repeated sequences (original encoding sequences), the insertion of these sequences 
into an appropriate vector, and the subsequent introduction of the vector into an appropriate 
host cell. The reassortment of the individual molecular identities occurs by combinatorial 

15 processes between the consecutive sequences in the construct possessing regions of 

homology, or between quasi-repeated units. The reassortment process recombines and/or 
reduces the complexity and extent of the repeated sequences, and results in the production of 
novel molecular species. Various treatments may be applied to enhance the rate of 
reassortment, such as ultra-violet light or DNA damaging chemicals. In addition, host cell 

20 lines displaying enhanced levels of "genetic instability" can be used. 

[000165] Repeated Sequences - Repeated or "quasi-repeated" sequences play a role in 
genetic instability. In the present invention, "quasi-repeats" are repeats that are not identical 
in structure but, rather, represent an array of consecutive sequences which have a high degree 
of similarity or identity sequences. The reductive reassortment or deletion process in the cell 
25 reduces the complexity of the resulting construct by deleting sequences between positions 
within quasi-repeated sequences. Because the deletion (and potentially insertion) events can 
occur virtually anywhere within the quasi-repetitive units, these sequences provide a large 
repertoire of potential variants. 

[000166] When the quasi-repeated sequences are all ligated in the same orientation, for 

30 instance head-to-tail or vice versa, the endpoints of a deletion are, for the most part, equally 

likely to occur anywhere within the quasi-repeated sequences. In contrast, when the units are 

presented head-to-head or tail-to-tail, the inverted quasi-repeated sequences can form a 
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duplex which delineates the endpoints of the adjacent units and thereby favors deletion of 
discrete units. Therefore, it is preferable in the present invention that the quasi-repeated 
sequences are joined in the same orientation because random orientation of quasi-repeated 
sequences will result in the loss of reassortment efficiency, while consistent orientation of the 
5 sequences will offer the highest efficiency. Nonetheless, although having fewer of the 
contiguous sequences in the same orientation decreases the efficiency or reductive 
reassortment, it may still provide sufficient variation for the effective recovery of novel 
molecules. 

[000167] Sequences can be assembled in a head-to-tail orientation using any of a variety of 
10 methods, including the following: 

a) Primers can be utilized that include a poly-A head and poly-T tail which, when 
made single-stranded, would provide orientation. This is accomplished by having the first 
few bases of the primers made from RNA and hence easily removed by RNAse H. b) 

Primers can be utilized that include unique restriction cleavage sites. Multiple sites, a 
15 battery of unique sequences, and repeated synthesis and ligation steps would be required. 

c) The inner few bases of the primer can be thiolated and an exonuclease used to 
produce properly tailed molecules. 

[000168] The recovery of the reasserted sequences relies on the identification of cloning 
vectors with a reduced repetitive index (RI). The reassorted coding sequences can then be 
20 recovered by amplification. The products are recloned and expressed. The recovery of 
cloning vectors with reduced RI can be effected by: 

1) The use of vectors only stably maintained when the construct is reduced in 
complexity. 

2) The physical recovery of shortened vectors by physical procedures. In this 
25 case, the cloning vector would be recovered using standard plasmid isolation procedures and 

then size-fractionated using standard procedures (e.g., agarose gel or column with a low 
molecular weight cut off). 

3) The recovery of vectors containing interrupted genes can be selected when 
insert size decreases. 

30 4) The use of direct selection techniques wherein an expression vector is used 

and the appropriate selection is carried out. 
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[000169] Coding sequences from related organisms may demonstrate a high degree of 
homology but nonetheless encode quite diverse protein products. These types of sequences 
are particularly useful in the present invention as quasi-repeats. However, while the 
examples illustrated below demonstrate the reassortment of coding sequences with a high 
5 degree of identity (quasi-repeats), this process is not limited to nearly identical repeats. 

[000170] The following example demonstrates a method of the invention. Quasi-repetitive 
coding sequences derived from three different species are obtained. Each sequence encodes a 
protein with a distinct set of properties. Each of the sequences differs by one or more base 
pairs at unique positions in the sequences which are designated "A", "B" and "C" The 
L0 quasi-repeated sequences are separately or collectively amplified and ligated into random 
assemblies such that all possible permutations and combinations are available in the 
population of ligated molecules. The number of quasi-repeat units can be controlled by the 
assembly conditions. The average number of quasi-repeated units in a construct is defined as 
the repetitive index (RI). 

L5 [000171] Once formed, the constructs can be size-fractionated on an agarose gel according 
to published protocols, inserted into a cloning vector, and transfected into an appropriate host 
cell. The cells can then be propagated to allow reductive reassortment to occur. The rate of 
the reductive reassortment process may be stimulated by the introduction of DNA damage if 
desired. Whether the reduction in RI is mediated by deletion formation between repeated 

10 sequences by an "intra-molecular" mechanism, or mediated by recombination-like events 

through "inter-molecular" mechanisms is immaterial. The end result is a reassortment of the 
molecules into all possible combinations. 

[000172] In another aspect, prior to or during recombination or reassortment, 
polynucleotides of the invention or polynucleotides generated by the methods described 

15 herein can be subjected to agents or processes which promote the introduction of mutations 
into the original polynucleotides. The introduction of such mutations would increase the 
diversity of resulting hybrid polynucleotides and polypeptides encoded therefrom. The 
agents or processes which promote mutagenesis include, but are not limited to: (+)-CC-1065, 
or a synthetic analog such as (+)-CC- 1 065-(N3-adenine) (Sun et al. (1992), Biochemistry 

i0 31(10):2822-9); an N-acetylated or deacetylated 4 1 -fluoro-4-aminobiphenyl adduct capable of 
inhibiting DNA synthesis (see, for example, van de Poll et al. (1992), Carcinogenesis 
13(5):751-8); or a N-acetylated or deacetylated 4-aminobiphenyl adduct capable of inhibiting 
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DNA synthesis (see also, Van de Poll et al. (1992), supra); trivalent chromium, a trivalent 
chromium salt, a polycyclic aromatic hydrocarbon ("PAH") DNA adduct capable of 
inhibiting DNA replication, such as 7-bromomethyl-benz[a] anthracene ("BMA"), tris(2,3- 
dibromopropyl)phosphate ("Tris-BP"), l,2-dibromo-3-chloropropane ("DBCP"), 2- 
5 bromoacrolein (2BA), benzo[a]pyrene-7,8-dihydrodiol-9-10-epoxide ("BPDE"), a 
platinum(II) halogen salt, N-hydroxy-2-amino-3-methylimidazo[4,5-f]-quinoline ("N- 
hydroxy-IQ"), and N-hydroxy-2-amino-l-methyl-6-phenylimidazo[4,5-f]-pyridine ("N- 
hydroxy-PhlP"). Especially preferred means for slowing or halting PGR amplification 
consist of UV light (+)-CC-1065 and (+)-CC-1065-(N3-Adenine). Particularly encompassed 
10 means are DNA adducts or polynucleotides comprising the DNA adducts from the 

polynucleotides or polynucleotides pool, which can be released or removed by a process 
including heating the solution comprising the polynucleotides prior to further processing. 

[000173] GSSM™ - The invention also provides for the use of codon primers containing a 
degenerate N,N,G/T sequence to introduce point mutations into a polynucleotide, so as to 

15 generate a set of progeny polypeptides in which a full range of single amino acid 

substitutions is represented at each amino acid position, a method referred to as gene site- 
saturated mutagenesis (GSSM™). The oligos used are comprised contiguously of a first 
homologous sequence, a degenerate N,N,G/T sequence, and possibly a second homologous 
sequence. The progeny translational products from the use of such oligos include all possible 

20 amino acid changes at each amino acid site along the polypeptide, because the degeneracy of 
the N,N,G/T sequence includes codons for all 20 amino acids. 

[000174] In one aspect, one such degenerate oligo (comprising one degenerate N,N,G/T 
cassette) is used for subjecting each original codon in a parental polynucleotide template to a 
full range of codon substitutions. In another aspect, at least two degenerate N,N,G/T 

25 cassettes are used - either in the same oligo or not, for subjecting at least two original codons 
in a parental polynucleotide template to a full range of codon substitutions. Thus, more than 
one N,N,G/T sequence can be contained in one oligo to introduce amino acid mutations at 
more than one site. This plurality of N,N,G/T sequences can be directly contiguous, or 
separated by one or more additional nucleotide sequences. In another aspect, oligos 

30 serviceable for introducing additions and deletions can be used either alone or in combination 
with the codons containing an N,N,G/T sequence, to introduce any combination or 
permutation of amino acid additions, deletions, and/or substitutions. 
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[000175] Li a particular exemplification, it is possible to simultaneously mutagenize two or 
more contiguous amino acid positions using an oligo that contains contiguous N,N,G/T 
triplets, i.e., a degenerate (N,N,G/T) n sequence. 

[000176] In another aspect, the present invention provides for the use of degenerate 
5 cassettes having less degeneracy than the N,N,G/T sequence. For example, it may be 

desirable in some instances to use a degenerate triplet sequence comprised of only one N, 
where said N can be in the first second or third position of the triplet. Any other bases 
including any combinations and permutations thereof can be used in the remaining two 
positions of the triplet. Alternatively, it may be desirable in some instances to use a 
1 0 degenerate N,N,N triplet sequence, or an N,N, G/C triplet sequence. 

[000177] It is appreciated, however, that the use of a degenerate triplet (such as N,N,G/T or 
an N,N, G/C triplet sequence) as disclosed in the instant invention is advantageous for several 
reasons. In one aspect, this invention provides a means to systematically and fairly easily 
generate the substitution of the full range of the 20 possible amino acids into each and every 
amino acid position in a polypeptide. Thus, for a 100 amino acid polypeptide, the invention 
provides a way to systematically and fairly easily generate 2000 distinct species (/.#., 20 
possible amino acids per position times 100 amino acid positions). It is appreciated that there 
is provided, through the use of an oligo containing a degenerate N,N,G/T or an N,N, G/C 
triplet sequence, 32 individual sequences that code for the 20 possible amino acids. Thus, in 
a reaction vessel in which a parental polynucleotide sequence is subjected to saturation 
mutagenesis using one such oligo, there are generated 32 distinct progeny polynucleotides 
encoding 20 distinct polypeptides. In contrast, the use of a non-degenerate oligo in site- 
directed mutagenesis leads to only one progeny polypeptide product per reaction vessel. 

[000178] This invention also provides for the use of nondegenerate oligonucleotides, which 
25 can optionally be used in combination with degenerate primers disclosed. It is appreciated 
that in some situations, it is advantageous to use nondegenerate oligos to generate specific 
point mutations in a working polynucleotide. This provides a means to generate specific 
silent point mutations, point mutations leading to corresponding amino acid changes, and 
point mutations that cause the generation of stop codons and the corresponding expression of 
30 polypeptide fragments. 
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[000179] Thus, in one aspect, each saturation mutagenesis reaction vessel contains 
polynucleotides encoding at least 20 progeny polypeptide molecules such that all 20 amino 
acids are represented at the one specific amino acid position corresponding to the codon 
position mutagenized in the parental polynucleotide. The 32-fold degenerate progeny 
5 polypeptides generated from each saturation mutagenesis reaction vessel can be subjected to 
clonal amplification (e.g., cloned into a suitable E. coli host using an expression vector) and 
subjected to expression screening. When an individual progeny polypeptide is identified by 
screening to display a favorable change in property (when compared to the parental 
polypeptide), it can be sequenced to identify the correspondingly favorable amino acid 
1 0 substitution contained therein. 

[000180] It is appreciated that upon mutagenizing each and every amino acid position in a 
parental polypeptide using saturation mutagenesis as disclosed herein, favorable amino acid 
changes may be identified at more than one amino acid position. One or more new progeny 
molecules can be generated that contain a combination of all or part of these favorable amino 

15 acid substitutions. For example, if 2 specific favorable amino acid changes are identified in 
each of 3 amino acid positions in a polypeptide, the permutations include 3 possibilities at 
each position (no change from the original amino acid, and each of two favorable changes) 
and 3 positions. Thus, there are 3 x 3 x 3 or 27 total possibilities, including 7 that were 
previously examined - 6 single point mutations (i.e., 2 at each of three positions) and no 

20 change at any position. 

[000181] In yet another aspect, site-saturation mutagenesis can be used together with 
shuffling, chimerization, recombination and other mutagenizing processes, along with 
screening. This invention provides for the use of any mutagenizing process(es), including 
saturation mutagenesis, in an iterative manner. In one exemplification, the iterative use of 
25 any mutagenizing process(es) is used in combination with screening. 

[000182] Thus, in a non-limiting exemplification, polynucleotides and polypeptides of the 
invention can be derived by saturation mutagenesis in combination with additional 
mutagenization processes, such as process where two or more related polynucleotides are 
introduced into a suitable host cell such that a hybrid polynucleotide is generated by 
30 recombination and reductive reassortment 

46 



WO 03/097810 PCT/US03/15712 

[000183] Li addition to performing mutagenesis along the entire sequence of a gene, 
mutagenesis can be used to replace each of any number of bases in a polynucleotide 
sequence, wherein the number of bases to be mutagenized can be each integer from about 15 
to about 100,000. Thus, instead of mutagenizing every position along a molecule, one can 
5 subject every or a discrete number of bases (e.g., a subset totaling from about 15 to about 
100,000) to mutagenesis. Ih one aspect, a separate nucleotide is used for mutagenizing each 
position or group of positions along a polynucleotide sequence. A group of 3 positions to be 
mutagenized can be a codon. In one aspect, the mutations are introduced using a mutagenic 
primer, containing a heterologous cassette, also referred to as a mutagenic cassette. For 
10 example, cassettes can have from about 1 to about 500 bases. Each nucleotide position in 
such heterologous cassettes can be N, A, C, G, T, A/C, A/G, A/T, C/G, C/T, G/T, C/G/T, 
A/G/T, A/C/T, A/C/G, or E, where E is any base that is not A, C, G, or T. 

[000184] In a general sense, saturation mutagenesis comprises mutagenizing a complete set 
of mutagenic cassettes (for example, each cassette is about 1-500 bases in length) in a defined 

15 polynucleotide sequence to be mutagenized (for example, the sequence to be mutagenized is 
from about 15 to about 100,000 bases in length). Thus, a group of mutations (ranging from 
about 1 to about 100 mutations) is introduced into each cassette to be mutagenized. A 
grouping of mutations to be introduced into one cassette can be different or the same from a 
second grouping of mutations to be introduced into a second cassette during the application 

20 of one round of saturation mutagenesis. Such groupings are exemplified by deletions, 

additions, groupings of particular codons, and groupings of particular nucleotide cassettes. 

[000185] Defined sequences to be mutagenized include a whole gene, pathway, cDNA, 
entire open reading frame (ORF), promoter, enhancer, repressor/transactivator, origin of 
replication, intron, operator, or any polynucleotide functional group. Generally, a "defined 
25 sequence" for this purpose may be any polynucleotide that a 15 base-polynucleotide 

sequence, and polynucleotide sequences of lengths between about 15 bases and about 15,000 
bases (this invention specifically names every integer in between). Considerations in 
choosing groupings of codons include types of amino acids encoded by a degenerate 
mutagenic cassette. 

30 [000186] In a particularly preferred exemplification a grouping of mutations that can be 

introduced into a mutagenic cassette, this invention specifically provides for degenerate 

codon substitutions (using degenerate oligos) that code for 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 
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14, 15, 16, 17, 18, 19, and 20 amino acids at each position, and a library of polypeptides 
encoded thereby. 



[000187] One aspect of the invention is an isolated nucleic acid comprising one of the 
sequences of the Group A nucleic acid sequences, sequences substantially identical thereto, 
5 sequences complementary thereto, or a fragment comprising at least 10, 15, 20, 25, 30, 35, 
40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive bases of one of the sequences of the 
Group A nucleic acid sequences. The isolated nucleic acids may comprise DNA, including 
cDNA, genomic DNA, and synthetic DNA. The DNA may be double-stranded or single- 
stranded, and if single stranded may be the coding strand or non-coding (anti-sense) strand. 
10 Alternatively, the isolated nucleic acids may comprise RNA. 

[000188] As discussed in more detail below, the isolated nucleic acid sequences of the 
invention may be used to prepare one of the polypeptides of the Group B amino acid 
sequences, and sequences substantially identical thereto, or fragments comprising at least 5, 
10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids of one of the 
15 polypeptides of the Group B amino acid sequences, and sequences substantially identical 
thereto. 

[000189] Alternatively, the nucleic acid sequences of the invention may be mutagenized 
using conventional techniques, such as site directed mutagenesis, or other techniques familiar 
to those skilled in the art, to introduce silent changes into the polynucleotides of the Group A 
10 nucleic acid sequences, and sequences substantially identical thereto. As used herein, "silent 
changes" include, for example, changes which do not alter the amino acid sequence encoded 
by the polynucleotide. Such changes may be desirable in order to increase the level of the 
polypeptide produced by host cells containing a vector encoding the polypeptide by 
introducing codons or codon pairs which occur frequently in the host organism. 

25 [000190] The invention also relates to polynucleotides which have nucleotide changes 
which result in amino acid substitutions, additions, deletions, fusions and truncations in the 
polypeptides of the invention (e.g., the Group B amino acid sequences). Such nucleotide 
changes may be introduced using techniques such as site-directed mutagenesis, random 
chemical mutagenesis, exonuclease III deletion, and other recombinant DNA techniques. 

30 Alternatively, such nucleotide changes may be naturally occurring allelic variants which are 
isolated by identifying nucleic acid sequences which specifically hybridize to probes 
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comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 
consecutive bases of one of the sequences of the Group A nucleic acid sequences, and 
sequences substantially identical thereto (or the sequences complementary thereto) under 
conditions of high, moderate, or low stringency as provided herein. 

5 Immobilized Enzyme Solid Supports 

[000191] The enzymes, fragments thereof and nucleic acids which encode the enzymes and 
fragments can be affixed to a solid support. This is often economical and efficient in the use 
of the enzymes in industrial processes. For example, a consortium or cocktail of enzymes (or 
active fragments thereof), which are used in a specific chemical reaction, can be attached to a 
10 solid support and dunked into a process vat. The enzymatic reaction can occur. Then, the 

solid support can be taken out of the vat, along with the enzymes affixed thereto, for repeated 
use. In one aspect of the invention, the isolated nucleic acid is affixed to a solid support. In 
another aspect of the invention, the solid support is selected from the group of a gel, a resin, a 
polymer, a ceramic, a glass, a microelectrode and any combination thereof. 

15 [000192] For example, solid supports useful in this invention include gels. Some examples 
of gels include sepharose, gelatin, glutaraldehyde, chitosan-treated glutaraldehyde, albumin- 
glutaraldehyde, chitosan-Xanthan, toyopearl gel (polymer gel), alginate, alginate-polylysine, 
carrageenan, agarose, glyoxyl agarose, magnetic agarose, dextran-agarose, poly(Carbamoyl 
Sulfonate) hydrogel, BSA-PEG hydrogel, phosphorylated polyvinyl alcohol (PVA), 

10 mono amino ethyl-N- amino ethyl (MANA), amino, or any combination thereof. 

[000193] Another solid support useful in the present invention are resins or polymers. 
Some examples of resins or polymers include cellulose, acrylamide, nylon, rayon, polyester, 
anion-exchange resin, AMBERLITE™ XAD-7, AMBERLITE™ XAD-8, AMBERLITE™ 
IRA-94, AMBERLITE™ IRO50, polyvinyl, polyacrylic, polymethacrylate, or any 

15 combination thereof. Another type of solid support useful in the present invention is ceramic. 
Some examples include non-porous ceramic, porous ceramic, SiC>2, AI2O3. Another type of 
solid support useful in the present invention is glass. Some examples include non-porous 
glass, porus glass, aminopropyl glass or any combination thereof. Another type of solid 
support which can be used is a mcroelectrode. An example is a polyethyleneimine-coated 

50 magnetite. Graphitic particles can be used as a solid support. Another example of a solid 
support is a cell, such as a red blood cell. 
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Methods of immobilization 

[000194] There are many methods which would be known to one of skill in the art for 
immobilizing enzymes or fragments thereof, or nucleic acids, onto a solid support. Some 
examples of such methods include electrostatic droplet generation, electrochemical means, 
5 via adsorption, via covalent binding, via cross-linking, via a chemical reaction or process, via 
encapsulation, via entrapment, via calcium alginate, or via poly (2-hydroxyethyl 
methacrylate). Like methods are described in Methods in Enzymology, Immobilized Enzymes 
and Cells \ Part C. 1987. Academic Press. Edited by S. P. Colowick and N. O. Kaplan. 
Volume 136; and Immobilization of Enzymes and Cells. 1997. Humana Press. Edited by G. 

10 F. Bickerstaff. Series: Methods in Biotechnology, Edited by J. M. Walker. 

[000195] Probes - The isolated nucleic acids of the Group A nucleic acid sequences, 
sequences substantially identical thereto, complementary sequences, or a fragment 
comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 
consecutive bases of one of the foregoing sequences may also be used as probes to determine 

15 whether a biological sample, such as a soil sample, contains an organism having a nucleic 
acid sequence of the invention or an organism from which the nucleic acid was obtained. In 
such procedures, a biological sample potentially harboring the organism from which the 
nucleic acid was isolated is obtained and nucleic acids are obtained from the sample. The 
nucleic acids are contacted with the probe under conditions which permit the probe to 

10 specifically hybridize to any complementary sequences which are present therein. 

[000196] Where necessary, conditions which permit the probe to specifically hybridize to 
complementary sequences may be determined by placing the probe in contact with 
complementary sequences from samples known to contain the complementary sequence as 
well as control sequences which do not contain the complementary sequence. Hybridization 

15 conditions, such as the salt concentration of the hybridization buffer, the formamide 

concentration of the hybridization buffer, or the hybridization temperature, can be varied to 
identify conditions which allow the probe to hybridize specifically to complementary nucleic 
acids. Stringent hybridization conditions are recited herein. 

[000197] Hybridization may be detected by labeling the probe with a detectable agent such 
50 as a radioactive isotope, a fluorescent dye or an enzyme capable of catalyzing the formation 
of a detectable product. Many methods for using the labeled probes to detect the presence of 
complementary nucleic acids in a sample are familiar to those skilled in the art. These 
include Southern Blots, Northern Blots, colony hybridization procedures, and dot blots. 
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Protocols for each of these procedures are provided in Ausubel et al. (1997), Current 
Protocols in Molecular Biology, John Wiley & Sons, Inc., and Sambrook et al. (1989), 
Molecular Cloning: A Laboratory Manual 2d Ed., Cold Spring Harbor Laboratory Press, the 
entire disclosures of which are incorporated herein by reference. 
5 [000198] In one example, a probe DNA is "labeled" with one partner of a specific binding 
pair (i.e., a ligand) and the other partner of the pair is bound to a solid matrix to provide ease 
of separation of target from its source. For example, the ligand and specific binding partner 
can be selected from, in either orientation, the following: (1) an antigen or hapten and an 
antibody or specific binding fragment thereof; (2) biotin or iminobiotin and avidin or 

10 streptavidin; (3) a sugar and a lectin specific therefor; (4) an enzyme and an inhibitor 
therefor; (5) an apoenzyme and cofactor; (6) complementary homopolymeric 
oligonucleotides; and (7) a hormone and a receptor therefor. In one example, the solid phase 
is selected from: (1) a glass or polymeric surface; (2) a packed column of polymeric beads; 
and (3) magnetic or paramagnetic particles. 

1 5 [000199] Alternatively, more than one probe (at least one of which is capable of speci fically 
hybridizing to any complementary sequences which are present in the nucleic acid sample), 
may be used in an amplification reaction to determine whether the sample contains an 
organism containing a nucleic acid sequence of the invention (e.g., an organism from which 
the nucleic acid was isolated). Typically, the probes comprise oligonucleotides. In one 

20 aspect, the amplification reaction may comprise a PGR reaction. PGR protocols are 

described in Ausubel et al. (1997), supra, and Sambrook et al. (1989), supra. Alternatively, 
the amplification may comprise a ligase chain reaction, 3SR, or strand displacement reaction. 
(See Barany (1991), PCR Methods and Applications 1:5-16; Fahy et al. (1991), PCR Methods 
and Applications 1:25-33; and Walker et al. (1 992), Nucleic Acid Research 20:1691-1696, 

25 the disclosures of which are incorporated herein by reference in their entireties). 

[000200] Probes derived from sequences near the ends of a sequence as set forth in the 
Group A nucleic acid sequences, and sequences substantially identical thereto, may also be 
used in chromosome walking procedures to identify clones containing genomic sequences 
located adjacent to the nucleic acid sequences as set forth above. Such methods allow the 

30 isolation of genes which encode additional proteins from the host organism. 

[000201] An isolated nucleic acid sequence as set forth in the Group A nucleic acid 
sequences, sequences substantially identical thereto, sequences complementary thereto, or a 
fragment comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 
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500 consecutive bases of one of the foregoing sequences maybe used as probes to identify 
and isolate related nucleic acids. In some aspects, the related nucleic acids may be cDNAs or 
genomic DNAs from organisms other than the one from which the nucleic acid was isolated. 
For example, the other organisms may be related organisms. In such procedures, a nucleic 
5 acid sample is contacted with the probe under conditions which permit the probe to 

specifically hybridize to related sequences. Hybridization of the probe to nucleic acids from 
the related organism is then detected using any of the methods described above. 
[000202] In nucleic acid hybridization reactions, the conditions used to achieve a particular 
level of stringency will vary, depending on the nature of the nucleic acids being hybridized. 

10 For example, the length of the nucleic acids, the amount of complementarity between the 
nucleic acids, the nucleotide sequence composition (e.g., G-C rich v. A-T rich content), and 
the nucleic acid type (e.g., RNA v. DNA) can be considered in selecting hybridization 
conditions. Stringency may be varied by conducting the hybridization at varying 
temperatures below the melting temperatures of the probes. The melting temperature, Tm, is 

15 the temperature (under defined ionic strength and pH) at which 50% of the target sequence 
hybridizes to a perfectly complementary probe. Stringent conditions are selected to be equal 
to or about 5°C lower than the Tm for a particular probe. The melting temperature of the 
probe may be calculated using the following formulas: 

[000203] For probes between 14 and 70 nucleotides in length the melting temperature (Tm) 
20 is calculated using the formula: Tm=8 1 .5+1 6.6(log [Na + ])+0.41 (fraction G+C)-(600/N) 
where N is the length of the probe. 

[000204] If the hybridization is carried out in a solution containing formamide, the melting 
temperature may be calculated using the equation: Tm=81.5+16.6(log [Na + ])+0.41 (fraction 
G+C)-(0.63% formamide)-(600/N) where N is the length of the probe. 

25 [000205] Expression Libraries - Expression libraries can be created using the 

polynucleotides of the invention in combination with expression vectors and appropriate host 
cells. The library allows for the in vivo expression of the polypeptides which are encoded by 
the polynucleotides of the invention. After such expression libraries have been generated one 
can include the additional step of "biopanning" such libraries prior to screening by cell 

30 sorting. The "biopanning" procedure refers to a process for identifying clones having a 

specified biological activity by screening for sequence identity in a library of clones prepared 
by (i) selectively isolating target DNA derived from at least one microorganism by use of at 
least one probe DNA comprising at least a portion of a DNA sequence encoding a 
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polypeptide having a specified biological activity (e.g., nitrilase activity); and (ii) optionally 
transforming a host with the isolated target DNA to produce a library of clones which are 
screened for the specified biological activity. 

[000206] The probe DNA used for selectively isolating the target DNA of interest from the 
5 DNA derived from at least one microorganism can be a full-length coding region sequence or 
a partial coding region sequence of DNA for an enzyme of known activity. The original 
DNA library can be probed using mixtures of probes comprising at least a portion of DNA 
sequences encoding enzymes having the specified enzyme activity. These probes or probe 
libraries are single-stranded and the microbial DNA which is probed has been converted into 
10 single-stranded form. The probes that are particularly suitable are those derived from DNA 
encoding enzymes having an activity similar or identical to the specified enzyme activity that 
is to be screened. 

[000207] Having prepared a multiplicity of clones from DNA selectively isolated from an 
organism, such clones are screened for a specific enzyme activity and to identify the clones 

15 having the specified enzyme characteristics. 

[000208] The screening for enzyme activity may be affected on individual expression 
clones or may be initially affected on a mixture of expression clones to ascertain whether or 
not the mixture has one or more specified enzyme activities. If the mixture has a specified 
enzyme activity, then the individual clones may be rescreened for such enzyme activity or for 

20 a more specific activity. Thus, for example, if a clone mixture has nitrilase activity, then the 
individual clones may be recovered and screened to determine which of such clones has 
nitrilase activity. 

[000209] As described with respect to one of the above aspects, the invention provides a 
process for enzyme activity screening of clones containing selected DNA derived from a 

25 microorganism which process includes: screening a library for specified enzyme activity, 
said library including a plurality of clones, said clones having been prepared by recovering 
from genomic DNA of a microorganism selected DNA, which DNA is selected by 
hybridization to at least one DNA sequence which is all or a portion of a DNA sequence 
encoding an enzyme having the specified activity; and transforming a host with the selected 

30 DNA to produce clones which are screened for the specified enzyme activity. 

[000210] In one aspect, a DNA library derived from a microorganism is subjected to a 
selection procedure to select therefrom DNA which hybridizes to one or more probe DNA 
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sequences which is all or a portion of a DNA sequence encoding an enzyme having the 
specified enzyme activity by: 

(a) contacting the single-stranded DNA population from the DNA library with the 
DNA probe bound to a ligand under stringent hybridization conditions so as to produce a 

5 duplex between the probe and a member of the DNA library; 

(b) contacting the duplex with a solid phase specific binding partner for the ligand so 
as to produce a solid phase complex; 

(c) separating the solid phase complex from the non-duplexed members of the DNA 

library; 

1 0 (d) denaturing the duplex to release the member of the DNA library; 

(e) creating a complementary DNA strand of the member from step (d) so as to make the 
member a double-stranded DNA; 

(f) introducing the double-stranded DNA into a suitable host so as to express a 
polypeptide which is encoded by the member DNA; and 

15 (g) determining whether the polypeptide expressed exhibits the specified enzymatic 

activity. 

[000211] In another aspect, the process includes a preselection to recover DNA including 
signal or secretion sequences. In this manner it is possible to select from the genomic DNA 
population by hybridization as hereinabove described only DNA which includes a signal or 
20 secretion sequence. The following paragraphs describe the protocol for this aspect of the 
invention, the nature and function of secretion signal sequences in general and a specific 
exemplary application of such sequences to an assay or selection process. 
[000212] A particularly aspect of this aspect further comprises, after (a) but before (b) 
above, the steps of: 

25 (i) contacting the single-stranded DNA population of (a) with a ligand-bound 

oligonucleotide probe that is complementary to a secretion signal sequence unique to a given 
class of proteins under hybridization conditions to form a double-stranded DNA duplex; 

(ii) contacting the duplex of (i) with a solid phase specific binding partner for said 
ligand so as to produce a solid phase complex; 

3 o (iii) separating the solid phase complex from the single-stranded DNA population of 

(a); 

(iv) denaturing the duplex so as to release single-stranded DNA members of the 
genomic population; and 
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(v) separating the single-stranded DNA members from the solid phase bound probe. 
[000213] The DNA which has been selected and isolated to include a signal sequence is 
then subjected to the selection procedure hereinabove described to select and isolate 
therefrom DNA which binds to one or more probe DNA sequences derived from DNA 
5 encoding an enzyme(s) having the specified enzyme activity. This procedure is described 
and exemplified in U.S. Pat. No. 6,054,267, incorporated herein by reference in its entirety. 
[000214] In vivo biopaiming may be performed utilizing a (fluorescence activated cell 
sorter) FACS-based machine. Complex gene libraries are constructed with vectors which 
contain elements which stabilize transcribed RNA. For example, the inclusion of sequences 

10 which result in secondary structures such as hairpins which are designed to flank the 

transcribed regions of the RNA would serve to enhance their stability, thus increasing their 
half life within the cell. The probe molecules used in the biopaiming process consist of 
oligonucleotides labeled with reporter molecules that only fluoresce upon binding of the 
probe to a target molecule. These probes are introduced into the recombinant cells from the 

15 library using one of several transformation methods. The probe molecules bind to the 

transcribed target mRNA resulting in DNA/RNA heteroduplex molecules. Binding of the 
probe to a target will yield a fluorescent signal that is detected and sorted by the FACS 
machine during the screening process. 

[000215] In some aspects, the nucleic acid encoding one of the polypeptides of the Group B 
20 amino acid sequences, sequences substantially identical thereto, or fragments comprising at 
least about 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids thereof 
is assembled in appropriate phase with a leader sequence capable of directing secretion of the 
translated polypeptide or fragment thereof. Optionally, the nucleic acid can encode a fusion 
polypeptide in which one of the polypeptides of the Group B amino acid sequences, 
25 sequences substantially identical thereto, or fragments comprising at least 5, 10, 15, 20, 25, 
30, 35, 40, 50, 75, 100, or 150 consecutive amino acids thereof is fused to heterologous 
peptides or polypeptides, such as N-terminal identification peptides which impart desired 
characteristics, such as increased stability or simplified purification. 
[000216] The host cell may be any of the host cells familiar to those skilled in the art, 
30 including prokaryotic cells, eukaryotic cells, mammalian cells, insect cells, or plant cells. As 
representative examples of appropriate hosts, there may be mentioned: bacterial cells, such as 
E. coli, Streptomyces, Bacillus subtilis, Salmonella typhimurium and various species within 
the genera Pseudomonas, Streptomyces, and Staphylococcus, fungal cells, such as yeast, 
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insect cells such as Drosophila S2 and Spodoptera Sf9, animal cells such as CHO, COS or 
Bowes melanoma, and adenoviruses. The selection of an appropriate host is within the 
abilities of those skilled in the art. 

[000217] Where appropriate, the engineered host cells can be cultured in conventional 
5 nutrient media modified as appropriate for activating promoters, selecting transfonnants or 
amplifying the genes of the invention. Following transformation of a suitable host strain and 
growth of the host strain to an appropriate cell density, the selected promoter may be induced 
by appropriate means (e.g., temperature shift or chemical induction) and the cells may be 
cultured for an additional period to allow them to produce the desired polypeptide or 
1 0 fragment thereof. 

[000218] Cells are typically harvested by centrifugation, disrupted by physical or chemical 
means, and the resulting crude extract is retained for further purification. Microbial cells 
employed for expression of proteins can be disrupted by any convenient method, including 
freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents. Such 

15 methods are well known to those skilled in the art. The expressed polypeptide or fragment 
thereof can be recovered and purified from recombinant cell cultures by methods including 
ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange 
chromatography, phosphocellulose chromatography, hydrophobic interaction 
chromatography, affinity chromatography, hydroxylapatite chromatography and lectin 

20 chromatography. Protein refolding steps can be used, as necessary, in completing 

configuration of the polypeptide. If desired, high performance liquid chromatography 
(HPLC) can be employed for final purification steps. 

[000219] Various mammalian cell culture systems can also be employed to express 
recombinant protein. Examples of mammalian expression systems include the COS -7 lines 
25 of monkey kidney fibroblasts (described by Gluzman (1981), Cell 23:175,), and other cell 
lines capable of expressing proteins from a compatible vector, such as the C127, 3T3, CHO, 
HeLa and BHK cell lines. 

[000220] The invention also relates to variants of the polypeptides of the Group B amino 
acid sequences, sequences substantially identical thereto, or fragments comprising at least 5, 
30 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids thereof. In particular, 
the variants may differ in amino acid sequence from the polypeptides of the Group B amino 
acid sequences, and sequences substantially identical thereto, by one or more substitutions, 
additions, deletions, fusions and truncations, which may be present in any combination. 
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[000221] The variants may be naturally occurring or created in vitro. In particular, such 
variants may be created using genetic engineering techniques such as site directed 
mutagenesis, random chemical mutagenesis, Exonuclease III deletion procedures, and 
standard cloning techniques. Alternatively, such variants, fragments, analogs, or derivatives 
5 may be created using chemical synthesis or modification procedures. 

[000222] Other methods of making variants are also familiar to those skilled in the art. 
These include procedures in which nucleic acid sequences obtained from natural isolates are 
modified to generate nucleic acids which encode polypeptides having characteristics which 
enhance their value in industrial or laboratory applications. In such procedures, a large 
10 number of variant sequences having one or more nucleotide differences with respect to the 
sequence obtained from the natural isolate are generated and characterized. Typically, these 
nucleotide differences result in amino acid changes with respect to the polypeptides encoded 
by the nucleic acids from the natural isolates. 
Error Prone PCR 

1 5 [000223] For example, variants may be created using error prone PCR. In error prone PCR, 
PCR is performed under conditions where the copying fidelity of the DNA polymerase is 
low, such that a high rate of point mutations is obtained along the entire length of the PCR 
product. Error prone PCR is described in Leung et aL (1989), Technique 1:11-15 and 
Caldwell et al. (1992), PCR Methods Applic, 2:28-33, the disclosures of which are 

20 incorporated herein by reference in their entirety. Briefly, in such procedures, nucleic acids 
to be mutagenized are mixed with PCR primers and reagents (e.g., reaction buffer, MgCl 2? 
MnCk, Taq polymerase and an appropriate concentration of dNTPs) for achieving a high rate 
of point mutation along the entire length of the PCR product. For example, the reaction may 
be performed using 20 fmoles of nucleic acid to be mutagenized, 30 pmoles of each PCR 

25 primer, a reaction buffer comprising 50mM KC1, lOmM Tris HC1 (pH 8.3) and 0.01% 
gelatin, 7mM MgCl 2 , 0.5mM MnCl 2 , 5 units of Taq polymerase, 0.2mM dGTP, 0.2mM 
dATP, ImM dCTP, and lmM dTTP. PCR maybe performed for 30 cycles of 94°C for 1 
min, 45 °C for 1 min, and 72°C for 1 min. However, it will be appreciated that these 
parameters may be varied as appropriate. The mutagenized nucleic acids are cloned into an 

30 appropriate vector and the activities of the polypeptides encoded by the mutagenized nucleic 
acids are evaluated. 

[000224] Variants also may be created using oligonucleotide directed mutagenesis to 
generate site-specific mutations in any cloned DNA of interest. Oligonucleotide mutagenesis 
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is described in Reidhaar-Olson et al. (1988), Science, 241:53-57, the disclosure of which is 
incorporated herein by reference in its entirety. Briefly, in such procedures a plurality of 
double stranded oligonucleotides bearing one or more mutations to be introduced into the 
cloned DNA are synthesized and inserted into the cloned DNA to be mutagenized. Clones 
containing the mutagenized DNA are recovered and the activities of the polypeptides they 
encode are assessed. 
Assembly PCR 

[000225] Another method for generating variants is assembly PCR. Assembly PCR 
involves the assembly of a PCR product from a mixture of small DNA fragments. A large 
number of different PCR reactions occur in parallel in the same vial, with the products of one 
reaction priming the products of another reaction. Assembly PCR is described in U.S. Pat. 
No. 5,965,408, the disclosure of which is incorporated herein by reference in its entirety. 
Sexual PCR mutagenesis 

[000226] Still another method of generating variants is sexual PCR mutagenesis. In sexual 
PCR mutagenesis, forced homologous recombination occurs between DNA molecules of 
different but highly related DNA sequence in vitro, as a result of random fragmentation of the 
DNA molecule based on sequence homology, followed by fixation of the crossover by primer 
extension in a PCR reaction. Sexual PCR mutagenesis is described in Stemmer (1994), Proc. 
Natl. Acad. Sci. USA 91:10747-10751, the disclosure of which is incorporated herein by 
reference in its entirety. Briefly, in such procedures a plurality of nucleic acids to be 
recombined are digested with DNAse to generate fragments having an average size of about 
50-200 nucleotides. Fragments of the desired average size are purified and resuspended in a 
PCR mixture. PCR is conducted under conditions which facilitate recombination between 
the nucleic acid fragments. For example, PCR may be performed by resuspending the 
purified fragments at a concentration of 10-30 ngj\il in a solution of 0.2mM of each dNTP, 
2.2mM MgCl 2 , 50mM KC1, lOmM Tris HC1, pH 9.0, and 0.1% Triton X-100. 2.5 units of 
Taq polymerase per 100|il of reaction mixture is added and PCR is performed using the 
following regime: 94°C for 60 seconds, 94°C for 30 seconds, 50-55°C for 30 seconds, 72°C 
for 30 seconds (30-45 times) and 72°C for 5 minutes. However, it will be appreciated that 
these parameters may be varied as appropriate. In some aspects, oligonucleotides may be 
included in the PCR reactions. In other aspects, the Klenow fragment of DNA polymerase I 
may be used in a first set of PCR reactions and Taq polymerase may be used in a subsequent 
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set of PCR reactions. Recombinant sequences are isolated and the activities of the 
polypeptides they encode are assessed. 
In vivo Mutagenesis 

[000227] Variants may also be created by in vivo mutagenesis. In some aspects, random 
5 mutations in a sequence of interest are generated by propagating the sequence of interest in a 
bacterial strain, such as an E. coli strain, which carries mutations in one or more of the DNA 
repair pathways. Such "mutator" strains have a higher random mutation rate than that of a 
wild-type parent. Propagating the DNA in one of these strains will eventually generate 
random mutations within the DNA. Mutator strains suitable for use for in vivo mutagenesis 
10 are described in PCT Publication No. WO 91/16427 the disclosure of which is incorporated 
herein by reference in its entirety. 
Cassette Mutagenesis 

[000228] Variants may also be generated using cassette mutagenesis. In cassette 
mutagenesis a small region of a double stranded DNA molecule is replaced with a synthetic 
15 oligonucleotide "cassette" that differs from the native sequence. The oligonucleotide often 
contains completely and/or partially randomized native sequence. 
Recursive Ensemble Mutagenesis 

[000229] Recursive ensemble mutagenesis may also be used to generate variants. 
Recursive ensemble mutagenesis is an algorithm for protein engineering (protein 

20 mutagenesis) developed to produce diverse populations of phenotypically related mutants 
whose members differ in amino acid sequence. This method uses a feedback mechanism to 
control successive rounds of combinatorial cassette mutagenesis. Recursive ensemble 
mutagenesis is described in Arkin et al. (1992), Proa Natl Acad Sci. USA, 89:7811-7815, 
the disclosure of which is incorporated herein by reference in its entirety. 

25 Exponential Ensemble Mutagenesis 

[000230] In some aspects, variants are created using exponential ensemble mutagenesis. 
Exponential ensemble mutagenesis is a process for generating combinatorial libraries with a 
high percentage of unique and functional mutants, wherein small groups of residues are 
randomized in parallel to identify, at each altered position, amino acids which lead to 

30 functional proteins. Exponential ensemble mutagenesis is described in Delegrave et al. 

(1993), Biotechnology Research 11:1548-1552, the disclosure of which incorporated herein 
by reference in its entirety. 
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Random and site-directed mutagenesis 

[000231] Random and site-directed mutagenesis is described in Arnold (1993), Current 
Opinions in Biotechnology 4:450-455, the disclosure of which is incorporated herein by 
reference in its entirety. 
5 Shuffling Procedures 

[000232] In some aspects, the variants are created using shuffling procedures wherein 
portions of a plurality of nucleic acids which encode distinct polypeptides are fused together 
to create chimeric nucleic acid sequences which encode chimeric polypeptides as described in 
U.S. Patent. Nos. 5,965,408 and 5,939,250, each of which is hereby incorporated by reference 

10 in their entireties. 

[000233] The variants of the polypeptides of the Group B amino acid sequences may be 
variants in which one or more of the amino acid residues of the polypeptides of the Group B 
amino acid sequences are substituted with a conserved or non-conserved amino acid residue 
(e.g, a conserved amino acid residue) and such substituted amino acid residue may or may 

15 not be one encoded by the genetic code. 

[000234] Conservative substitutions are those that substitute a given amino acid in a 
polypeptide by another amino acid of like characteristics. Typically seen as conservative 
substitutions are the following replacements: replacements of an aliphatic amino acid such as 
Alanine, Valine, Leucine and Isoleucine with another aliphatic amino acid; replacement of a 

20 Serine with a Threonine or vice versa; replacement of an acidic residue such as Aspartic acid 
and Glutamic acid with another acidic residue; replacement of a residue bearing an amide 
group, such as Asparagine and Glutamine, with another residue bearing an amide group; 
exchange of a basic residue such as Lysine and Arginine with another basic residue; and 
replacement of an aromatic residue such as Phenylalanine, Tyrosine with another aromatic 

25 residue. 

[000235] Other variants are those in which one or more of the amino acid residues of the 
polypeptides of the Group B amino acid sequences includes a substituent group. 
[000236] Still other variants are those in which the polypeptide is associated with another 
compound, such as a compound to increase the half-life of the polypeptide (for example, 
30 polyethylene glycol). 

[000237] Additional variants are those in which additional amino acids are fused to the 
polypeptide, such as a leader sequence, a secretory sequence, a proprotein sequence or a 
sequence which facilitates purification, enrichment, or stabilization of the polypeptide. 
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[000238] In some aspects, the fragments, derivatives and analogs retain the same biological 
function or activity as the polypeptides of the Group B amino acid sequences, and sequences 
substantially identical thereto. In other aspects, the fragment, derivative, or analog includes a 
proprotein, such that the fragment, derivative, or analog can be activated by cleavage of the 
5 proprotein portion to produce an active polypeptide. 

[000239] Another aspect of the invention is polypeptides or fragments thereof which have at 
least about 85%, at least about 90%, at least about 95%, or more than about 95% homology to 
one of the polypeptides of the Group B amino acid sequences, sequences substantially 
identical thereto, or a fragment comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 

10 or 150 consecutive amino acids thereof. Percent identity may be determined using any of the 
programs described above which aligns the polypeptides or fragments being compared and 
determines the extent of amino acid homology or similarity between them. It will be 
appreciated that amino acid "homology" includes conservative amino acid substitutions such 
as those described above. In one aspect of the invention, the fragments can be used to 

15 generate antibodies. These antibodies can be used to immobilize nitrilases can be used in 
industrial processes. Polynucleotides encoding the nitrilases of the present invention can be 
used in a similar way. 

[000240] Alternatively, the homologous polypeptides or fragments may be obtained through 
biochemical enrichment or purification procedures. The sequence of potentially homologous 

20 polypeptides or fragments may be determined by proteolytic digestion, gel electrophoresis 
and/or microsequencing. The sequence of the prospective homologous polypeptide or 
fragment can be compared to one of the polypeptides of the Group B amino acid sequences, 
sequences substantially identical thereto, or a fragment comprising at least about 5, 10, 15, 
20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids thereof using any of the 

25 programs described herein. 

[000241] Another aspect of the invention is an assay for identifying fragments or variants of 
the Group B amino acid sequences, or sequences substantially identical thereto, which retain 
the enzymatic function of the polypeptides of the Group B amino acid sequences, and 
sequences substantially identical thereto. For example, the fragments or variants of the 

30 polypeptides, may be used to catalyze biochemical reactions, which indicate that said 

fragment or variant retains the enzymatic activity of the polypeptides in Group B amino acid 
sequences. 
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[000242] The assay for determining if fragments of variants retain the enzymatic activity of 
the polypeptides of the Group B amino acid sequences, and sequences substantially identical 
thereto includes the steps of: contacting the polypeptide fragment or variant with a substrate 
molecule under conditions which allow the polypeptide fragment or variant to function, and 
detecting either a decrease in the level of substrate or an increase in the level of the specific 
reaction product of the reaction between the polypeptide and substrate. 
[000243] The polypeptides of the Group B amino acid sequences, sequences substantially 
identical thereto or fragments comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 
150 consecutive amino acids thereof may be used in a variety of applications. For example, 
the polypeptides or fragments thereof may be used to catalyze biochemical reactions. In 
accordance with one aspect of the invention, there is provided a process for utilizing a 
polypeptide of the Group B amino acid sequences, and sequences substantially identical 
thereto or polynucleotides encoding such polypeptides for hydrolyzing aminonitriles. In such 
procedures, a substance containing a haloalkane compound is contacted with one of the 
polypeptides of the Group B amino acid sequences, and sequences substantially identical 
thereto under conditions which facilitate the hydrolysis of the compound. 
[000244] Antibodies - The polypeptides of Group B amino acid sequences, sequences 
substantially identical thereto or fragments comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 
50, 75, 100, or 150 consecutive amino acids thereof, may also be used to generate antibodies 
which bind specifically to the enzyme polypeptides or fragments. The resulting antibodies 
may be used in immuno affinity chromatography procedures to isolate or purify the 
polypeptide or to determine whether the polypeptide is present in a biological sample. In 
such procedures, a protein preparation, such as an extract, or a biological sample is contacted 
with an antibody capable of specifically binding to one of the polypeptides of the Group B 
amino acid sequences, sequences substantially identical thereto, or fragments of the foregoing 
sequences. 

[000245] In immunoaffmity procedures, the antibody is attached to a solid support, such as 
a bead or column matrix. The protein preparation is placed in contact with the antibody 
under conditions under which the antibody specifically binds to one of the polypeptides of the 
Group B amino acid sequences, sequences substantially identical thereto, or fragments 
thereof. After a wash to remove non-specifically bound proteins, the specifically bound 
polypeptides are eluted. 
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[000246] The ability of proteins in a biological sample to bind to the antibody may be 
determined using any of a variety of procedures familiar to those skilled in the art. For 
example, binding may be determined by labeling the antibody with a detectable label such as 
a fluorescent agent, an enzymatic label, or a radioisotope. Alternatively, binding of the 
5 antibody to the sample may be detected using a secondary antibody having such a detectable 
label thereon. Particular assays include ELISA assays, sandwich assays, radioimmunoassays, 
and Western Blots. 

[000247] The antibodies of the invention can be attached to solid supports and used to 
immobilize nitrilases of the present invention. Such immobilized nitrilases can be used, as 
10 described above, in industrial chemical processes for the conversion of nitriles to a wide 
range of useful products and intermediates. 

[000248] Polyclonal antibodies generated against the polypeptides of the Group B amino 
acid sequences, and sequences substantially identical thereto, or fragments comprising at 
least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids thereof can be 

1 5 obtained by direct injection of the polypeptides into an animal or by administering the 

polypeptides to an animal. The antibody so obtained will then bind the polypeptide itself. In 
this manner, even a sequence encoding only a fragment of the polypeptide can be used to 
generate antibodies which may bind to the whole native polypeptide. Such antibodies can 
then be used to isolate the polypeptide from cells expressing that polypeptide. 

20 [000249] For preparation of monoclonal antibodies, any technique which provides 

antibodies produced by continuous cell line cultures can be used. Examples include the 
hybridoma technique (Kohler and Milstein (1975), Nature, 256:495-497, the disclosure of 
which is incorporated herein by reference), the trioma technique, the human B-cell 
hybridoma technique (Kozbor et al. (1983), Immunology Today 4:72, the disclosure of which 

25 is incorporated herein by reference), and the EBV-hybridoma technique (Cole et al. (1985), in 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96, the disclosure of 
which is incorporated herein by reference in its entirety). 

[000250] Techniques described for the production of single chain antibodies (U.S. Pat. No. 
4,946,778, the disclosure of which is incorporated herein by reference in its entirety) can be 
30 adapted to produce single chain antibodies to the polypeptides of, for example, the Group B 
amino acid sequences, or fragments thereof. Alternatively, transgenic mice may be used to 
express humanized antibodies to these polypeptides or fragments. 
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[000251] Antibodies generated against a polypeptide of the Group B amino acid sequences, 
sequences substantially identical thereto, or fragments comprising at least 5, 10, 15, 20, 25, 
30, 35, 40, 50, 75, 100, or 150 consecutive amino acids thereof may be used in screening for 
similar polypeptides from other organisms and samples. la such techniques, polypeptides 
from the organism are contacted with the antibody and those polypeptides which specifically 
bind to the antibody are detected. Any of the procedures described above may be used to 
detect antibody binding. One such screening assay is described in "Methods for Measuring 
Cellulase Activities", Methods in Enzymology, 160:87-116, which is hereby incorporated by 
reference in its entirety. 

Use of Whole Cells Comprising A Nucleic Acid 
[000252] The invention provides for the use of whole cells which have been transformed 
with nucleic acid (or an active fragment thereof) encoding one or more of the nitrilases of the 
invention. The invention also provides for the use of such a whole cell in performing a 
nitrilase reaction on a substrate. Therefore, this invention provides for methods of 
hydrolyzing a cyanohydrin or aminonitrile linkage using a whole cell comprising at least one 
nucleic acid or polypeptide disclosed herein (SEQ ID NOS:l-386). For example, a whole 
cell which is stably transfected (the invention also encompasses transiently transfected or 
transformed whole cells) with a nucleic acid encoding a nitrilase is one aspect of the 
invention. Such a cell is useful as a reagent in a reaction mixture to act on a substrate and 
exhibit nitrilase activity. 

Sequence Analysis Software 
[000253] Percent identity or homology between two or more sequences is typically 
measured using sequence analysis software (e.g., Sequence Analysis Software Package of the 
Genetics Computer Group, University of Wisconsin Biotechnology Center, Madison, WI). 
Such software matches similar sequences by assigning a percent identity or homology to 
various deletions, substitutions and other modifications. The term "percent identity," in the 
context of two or more nucleic acids or polypeptide sequences, refers to the percentage of 
nucleotides or amino acid residues that are the same when compared after being aligned for 
maximum correspondence over a designated region or comparison "window." Under some 
algorithms, a conservative amino acid substitution can be considered "identical" and a change 
at a wobble site of a codon can be considered "identical." 
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[000254] " Alignment" refers to the process of lining up two or more sequences to achieve 
maximal correspondence for the purpose of assessing the degree of identity or homology, as 
defined within the context of the relevant alignment algorithm. 

[000255] For sequence comparison, typically one sequence acts as a reference sequence, to 
5 which test sequences are compared. When using a sequence comparison algorithm, test and 
reference sequences are entered into a computer, subsequence coordinates are designated, if 
necessary, and sequence algorithm program parameters are designated for a particular 
algorithm. Default program parameters can be used, or alternative parameters can be 
designated. The sequence comparison algorithm then calculates the percent identity or 
10 homology for the test sequences relative to the reference sequence, based on the program 
parameters. 

[000256] A "comparison window", as used herein, is a segment of the contiguous positions 
in a nucleic acid or an amino acid sequence consisting of from 20 to 600, usually about 50 to 
about 200, more usually about 100 to about 150 nucleotides or residues, which may be 

15 compared to a reference sequence of the same or different number of contiguous positions 
after the two sequences are optimally aligned. Methods of alignment of sequences for 
comparison are well-known in the art. Optimal alignment of sequences for comparison can 
be conducted, e.g., by the local homology algorithm of Smith and Waterman (1981), Adv. 
Appl. Math. 2:482, by the homology alignment algorithm of Needleman and Wunsch (1970), 

20 J. Mol. Biol 48:443, by the search for similarity method of Pearson and Lipman (1988), Proc. 
Natl. Acad. Sci. USA 85:2444-2448, by computerized implementations of these algorithms, or 
by manual alignment and visual inspection. Other algorithms for determining homology or 
identity include, for example, the BLAST program (Basic Local Alignment Search Tool, 
National Center for Biological Information), BESTFIT, FASTA, and TFASTA (Wisconsin 

25 Genetics Software Package, Genetics Computer Group, Madison, WI), ALIGN, AMAS 

(Analysis of Multiply Aligned Sequences), AMPS (Alignment of Multiple Protein Sequence), 
ASSET (Aligned Segment Statistical Evaluation Tool), BANDS, BESTSCOR, BIOSCAN 
(Biological Sequence Comparative Analysis Node), BLIMPS (BLocks IMProved Searcher), 
Intervals and Points, BMB, CLUSTAL V, CLUSTAL W, CONSENSUS, LCONSENSUS, 

30 WCONSENSUS, Smith-Waterman algorithm, DARWIN, Las Vegas algorithm, FNAT 

(Forced Nucleotide Alignment Tool), Framealign, Framesearch, DYNAMIC, FILTER, FSAP 
(Fristensky Sequence Analysis Package), GAP (Global Alignment Program), GENAL, 
GIBBS, GenQuest, ISSC (Sensitive Sequence Comparison), LALIGN (Local Sequence 



WO 03/097810 PCT/US03/15712 

Alignment), LCP (Local Content Program), MACAW (Multiple Alignment Construction and 
Analysis Workbench), MAP (Multiple Alignment Program), MBLKP, MBLKN, PIMA 
(Pattern-Induced Multi-sequence Alignment), SAGA (Sequence Alignment by Genetic 
Algorithm) and WHAT-IF. Such alignment programs can also be used to screen genome 
5 databases to identify polynucleotide sequences having substantially identical sequences. A 
number of genome databases are available, for example, a substantial portion of the human 
genome is available as part of the Human Genome Sequencing Project (J. Roach, 
http://weber.u. Washington.edu/-roach/human_ genome_ progress 2.html) (Gibbs, 1995). At 
least twenty-one other genomes have already been sequenced, including, for example, M. 

10 genitalium (Fraser et al., 1995), M.jannaschii (Bult et al., 1996), H. influenzae (Fleischmann 
et al., 1995), E. coli (Blattner et al., 1997), and yeast (S. cerevisiae) (Mewes et al., 1997), and 
D. melanogaster (Adams et al., 2000). Significant progress has also been made in 
sequencing the genomes of model organism, such as mouse, C. elegans, and Arabadopsis sp. 
Several databases containing genomic information annotated with some functional 

15 information are maintained by different organizations, and are accessible via the internet, for 
example, http://wwwtigr.org/tdb; http://www.genetics.wisc.edu; http://genome- 
www.stanford.edu/-ball; http://hiv-web.lanl.gov; http://www.ncbi.nlm.nih.gov; 
http://www.ebi.ac.uk; http://Pasteur.fr/other/biology; and http://www.genome.wi.mit.edu. 
[000257] Examples of useful algorithms are the BLAST and the BLAST 2.0 algorithms, 

20 which are described in Altschul et al. (1977), Nuc. Acids Res. 25:3389-3402, and Altschul et 
, al. (1990), J. Mol Biol 215:403-410, respectively. Software for performing BLAST 
analyses is publicly available through the National Center for Biotechnology Information 
(http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring 
sequence pairs (HSPs) by identifying short words of length W in the query sequence, which 

25 either match or satisfy some positive-valued threshold score T when aligned with a word of 
the same length in a database sequence. T is referred to as the neighborhood word score 
threshold (Altschul et al., supra). These initial neighborhood word hits act as seeds for 
initiating searches to find longer HSPs containing them. The word hits are extended in both 
directions along each sequence for as far as the cumulative alignment score can be increased. 

30 Cumulative scores are calculated using the parameter M (reward score for a pair of matching 
residues; always >0). For amino acid sequences, a scoring matrix is used to calculate the 
cumulative score. Extension of the word hits in each direction are halted when: the 
cumulative alignment score falls off by the quantity X from its maximum achieved value; the 
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cumulative score goes to zero or below, due to the accumulation of one or more negative- 
scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm 
parameters W, T, and X determine the sensitivity and speed of the alignment. For nucleotide 
sequences, the BLASTN program uses as defaults a wordlength (W) of 1 1, an expectation (E) 
5 of 10, M=5, N— 4 and a comparison of both strands. For amino acid sequences, the BLASTP 
program uses as defaults a wordlength (W) of 3 , an expectation (E) of 10, and the 
BLOSUM62 scoring matrix (see Henikoff and Henikoff (1989), Proc. Natl Acad. Set USA 
89:10915). 

[000258] The BLAST algorithm also performs a statistical analysis of the similarity 
10 between two sequences (see, e.g., Karlin and Altschul (1993), Proc. Natl Acad. Set USA 
90:5873). One measure of similarity provided by BLAST algorithm is the smallest sum 
probability (P(N)), which provides an indication of the probability by which a match between 
two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid 
is considered similar to a references sequence if the smallest sum probability in a comparison 
15 of the test nucleic acid to the reference nucleic acid is Jess than about 0.2, less than about 
0.01, or less than about 0.001. 

[000259] In one aspect, protein and nucleic acid sequence homologies are evaluated using 
the Basic Local Alignment Search Tool ("BLAST"). In particular, five specific BLAST 
programs are used to perform the following task: 
20 (1) BLASTP and BLASTS compare an amino acid query sequence against 

a protein sequence database; 

(2) BLASTN compares a nucleotide query sequence against a nucleotide 

sequence database; 

(3) BLASTX compares the six-frame conceptual translation products of a 
25 query nucleotide sequence (both strands) against a protein sequence database; 

(4) TBLASTN compares a query protein sequence against a nucleotide 
sequence database translated in all six reading frames (both strands); and 

(5) TBLASTX compares the six-frame translations of a nucleotide query 
sequence against the six-frame translations of a nucleotide sequence database. 

30 [000260] The BLAST programs identify homologous sequences by identifying similar 
segments, which are referred to herein as "high-scoring segment pairs," between a query 
amino or nucleic acid sequence and a test sequence which may be obtained from a protein or 
nucleic acid sequence database. High-scoring segment pairs are identified (i.e., aligned) by 

67 



WO 03/097810 PCT/US03/15712 

means of a scoring matrix, many of which are known in the art. In one example, the scoring 
matrix used is the BLOSUM62 matrix (Gonnet et al. (1992), Science 256:1443-1445; 
Henikoff and Henikoff (1993), Proteins 17:49-61). In another example, the PAM or 
PAM250 matrices may also be used (see, e.g., Schwartz and Dayhoff, eds. (1978), Matrices 
5 for Detecting Distance Relationships: Atlas of Protein Sequence and Structure, Washington: 
National Biomedical Research Foundation). BLAST programs are accessible through the 
U.S. National Library of Medicine, e.g., at www.ncbi.nlm.nih.gov. 

[000261] The parameters used with the above algorithms may be adapted depending on the 
sequence length and degree of homology studied. In some aspects, the parameters may be 

10 the default parameters used by the algorithms in the absence of instructions from the user. 
[000262] In a particular aspect, the invention provides a method for modifying small 
molecules, comprising contacting a polypeptide encoded by a polynucleotide described 
herein or enzymatically active fragments thereof with a small molecule to produce a modified 
small molecule. A library of modified small molecules is tested to determine if a modified 

1 5 small molecule is present within the library which exhibits a desired activity. A specific 
biocatalytic reaction which produces the modified small molecule of desired activity is 
identified by systematically eliminating each of the biocatalytic reactions used to produce a 
portion of the library, and then testing the small molecules produced in the portion of the 
library for the presence or absence of the modified small molecule with the desired activity. 

20 The specific biocatalytic reactions, which produce the modified small molecule of, desired 
activity is optionally repeated. The biocatalytic reactions are conducted with a group of 
biocatalysts that react with distinct structural moieties found within the structure of a small 
molecule, each biocatalyst is specific for one structural moiety or a group of related structural 
moieties; and each biocatalyst reacts with many different small molecules which contain the 

25 distinct structural moiety. 

[000263] Some aspects of the use of the nitrilases are: 

a-hydroxv acid - Nitrilases produce oc-hydroxy acids through hydrolysis of 
cyanohydrins. Production of mandelic acid and derivatives thereof is an example of this. A 
significant application of this type involves commercial production of (2^-mandelic acid in 

30 both high yield and high enantio selectivity from mandelonitrile. Mandelic acid and 
derivatives have found broad application as intermediates and resolving agents for the 
production of many chiral pharmaceutical and agricultural products. Previous attempts to 
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employ the few known nitrilases in processes using analogous substrates have been plagued 
by significantly lower activity, productivity, and selectivity. 



SEQ ID HQS: 




98%ee 
completQDonv 




98% ee 99% ee 98% ee 



Phenyllactic acid derivatives 

[000264] An additional application is in the production of f$-phenyJ lactic acid derivatives 
in both high yield and high enantioselectivity. Phenyl lactic acid derivatives have found 
broad application in the production of many chiral pharmaceutical and agricultural products. 



SEQ ID NOS: 

(S) 
98%ee 
completcon . 




97% 96% 97% 



p -hydroxy acid 

[000265] With important commercial considerations, nitrilases are provided produce 
either enantiomer of 4-cyano-3-hydroxybutyric acid, the (2^-enanatiomer of which is a key 
intermediate in the synthesis of the drug LIPITOR™. 
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SEQ ID NOS: 109,1 10 




S elected 
N itrilases 



QH 




(S )-3-h yd roxy-4-cya no -butyric acid 



NC 



.COO H 



CM 



H 



F 



LiPlTOR 



[000266] The following nitrilases are more examples of nitrilases useful in converting 
hydroxyglutarylnitrile to (R)-3-hydroxy-4-cyano-butyric acid: SEQ ID NOS:205, 206, SEQ 
ID NOS:207, 208, SEQ ID NOS: 195, 196, SEQ ID NOS:43, 44, SEQ ID NOS:321, 322, and 
SEQ ID NOS:237, 238. The above schematic indicates that "selected nitrilases" can be used 
to convert hydroxyglutarylnitrile to (S)-3-hydroxy-4-cyano-butyric acid: SEQ ID NOS: 107, 
108, SEQ ID NOS:109 ; 110, SEQ ID NOS: 111, 112, SEQ ID NOS:127, 128, SEQ ID 
NOS:129, 130, SEQ ID NOS:133, 134, SEQ ID NOS:l 13, 114, SEQ ID NOS:145, 146, SEQ 
ID NOS:101, 102, SEQ ID NOS:179, 180, SEQ ID NOS:201, 202, SEQ ID NOS:159, 160, 
SEQ ID NOS: 177, 178, SEQ ID NOS: 181, 182, SEQ ID NOS: 183, 184, SEQ ID NOS: 185, 
186, SEQ ID NOS:57, 58, SEQ ID NOS:197, 198, SEQ ID NOS:59, 60, SEQ ID NOS:67, 
68, and SEQ ID NOS:359, 360. 

[000267] The invention will be further described with reference to the following examples; 
however, it is to be understood that the invention is not limited to such examples. Rather, in 
view of the present disclosure which describes the current best mode for practicing the 
invention, many modifications and variations would present themselves to those of skill in 
the art without departing from the scope and spirit of this invention. All changes, 
modifications, and variations coming within the meaning and range of equivalency of the 
claims are to be considered within their scope. 



EXAMPLES 
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[000268] For each library to be screened for nitrilases, an infection was set up as follows: 
5ml of an OD6 0 onm=i resuspension of SEL700 cells and 1ml of the phagemid library to be 
screened were combined. The combination was incubated in a 37°C waterbath for 45 min. 

5 [000269] Using the infection, serial dilutions were made in lOmM MgSC>4 , using 10]al 
aliquots of the infection. 

titer of library dilutions to make 

~10 5 cfu/ml 10" 1 dilution 

~10 6 cfu/ml 10"\ 10 -2 dilution 

10 ~10 7 cfu/ml 10" 1 , 10" 2 , 10" 3 dilution 

[000270] 60yd of each of the following dilutions were deposited onto a small LB-kan 50 
plate: 

titer of library dilutions to make 

15 ~10 5 cfu/ml undiluted infection, 10" 1 dilution 

~10 6 cfu/ml 10" 1 , 10" 2 dilutions 

~10 7 cfu/ml 10" 2 ? 10" 3 dilutions 

[000271] The cells in the infection were centrifuged in a tabletop centrifuge at 4°C, 4.6k 
20 rpm ? 10 min to form pellets. The supernatant was decanted from the resulting pellets. The 
cells were resuspended in residual liquid. All of the resuspended cells were deposited onto a 
single large LB-kan 50 plate. All plates were incubated at 30°C overnight. 

Example 2: Selection ScreeninRs 

[000272] The cells of each infection plate were resuspended with ~4mls 1 OmM MgSC>4. 

25 The resuspensions were placed in a tube. The remaining cells on each plate were 

resuspended with ~3mls lOmM MgSCU and combined with the first resuspension from the 
same plate. The volume of each tube was brought to 12ml with lOmM MgS0 4 , The tubes 
were vortexed vigorously. The tubes were centrifuged in a tabletop centrifuge at 4°C and 
4.6k for lOmin to form pellets. The supernatant was decanted from each resuspension. The 

30 washed cells in each tube were resuspended with 10ml lOmM MgSC>4. The resuspensions 
from each library were stored at 4°C until the selection cultures were ready to be set up. 

[000273] For each resuspension, selection cultures were set up using the following process: 
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1) The nitrilase selection medium was prepared, using: 1XM9 medium with 0.2% 
glucose, no nitrogen and 50pg/ml kanamycin (for pBK phagemid libraries only; 
use ampicillin for pBS libraries). 

2) 5ml of the medium was aliquoted into a 50ml screw top conical tube. 

3) 25pl of the stored resuspension was added to the tube. 

4) 5pl of adiponitrile was added to the tube, to bring the final concentration to 
8.8mM. Additional nitrile substrates may be used, in place of adiponitrile. 

5) The resulting combination was cultured at 30°C. 

Steps 1-5 were repeated for each nitrile substrate. 
Example 3: Isolation of a positive nitrilase clone from selection cultures 

[000274] Ten (10) pi of selection culture with growth was streaked out onto a small LB- 
kan 50 plate and allowed to grow for 2 nights at 30°C. Five isolated cfu were picked and each 
was grown in 2ml nitrilase selection medium at 30°C. Each culture was monitored (where 
growth indicates positive cfu was picked), and was removed when monitoring indicated that 
it was in a stationary phase of growth. One (1) ml of culture was used to do a plasmid 
preparation and was eluted with 40pl elution buffer. Five to eight (5-8) pi DNA was cut with 
Pst I/Xho I or Sac I/Kpn I restriction enzymes to remove insert from vector. A restriction 
fragment length polymorphism (RFLP) determination was carried out to identify the size of 
the insert. The insert was sequenced. 

Example 4: Screening and Characterization of Nitrilases 

[000275] Nitrilases of the invention were screened against target substrates. Of those 
showing hydrolytic activity in a primary screen, enzymes with enantioselectivities above 20% 
enantiomeric excess (ee) were selected for further characterization. Those enzymes were 
selected based on: 1) having activity against one of the substrates of interest and 2) 
exhibition of greater than 35% ee (enantiomeric excess). The results of this screening 
process are set forth in Table 1 above. The products used for screening were: D- 
Phenylglycine, L-Phenyllactic acid, (J?) 2-chloromandelic acid, (S)- Cyclohexylmandelic 
acid, L-2-methylphenylglycine, ( S ) -2 - amino - 6 -hydro xy hexanoic acid, and 4-methyl-L- 
leucine. 
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CN 




H 2 N 



NH 2 



COOH 



Phenylglycinonitrile 



D-phenylglycine 



[000276] The hydrolysis of phenylglycinonitrile was performed. Some of these enzymes 
5 showed an ee higher than 20% and those were selected for preliminary characterization. 

[000277] Based on the preliminary characterization experiments, a number of putative hits 
were identified on phenylglycinonitrile and a large amount of data was accumulated on these 
enzymes. The data revealed many common properties: the majority of the enzymes had pH 
optima for activity at pH 7 and, in general, the enantioselectivity was enhanced at the lower 

10 pH values. The enzymes were found to be more active at higher temperature, particularly 
38°C, although this temperature often resulted in lower enantioselectivities. The use of 
water-miscible co-solvents in the reaction was shown to be a practical option. The inclusion 
of 10-25% methanol (v/v) in the enzyme reactions did not substantially affect enzyme 
activity and in many cases, led to an increase in enantioselectivity. The use of biphasic 

1 5 systems has also shown some promise, with the enzymes maintaining their level of activity 
with the addition of up to 70% (v/v) of hexane and, in some cases, toluene. The use of ethyl 
acetate in the biphasic systems, however, led to lower activity. 

[000278] Of the enzymes identified active on phenylglycinonitrile, the enantioselectivity of 
several enzymes was shown to remain above the success criterion of 35% ee. The 

20 preliminary characterization data indicated that some of the enzymes exhibited high 

enantioselectivities for D-phenylglycine, with corresponding conversion to product of 40- 
60%. Further investigation suggested that the rate of activity of some of these enzymes was 
faster than the rate of racemization of the substrate. Reducing the concentration of enzyme 
led to improved enantioselectivity; therefore, it appears that some benefit could be gained by 

25 control of the relative rates of the chemical racemization and the enzyme activity. 
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Screening of nitrilases against target substrate fjgV2-chloromandelic acid 



OH 



HO H 



v Ci 
2-chloromandelonitrile 




CN 




COOH 



(R)-2-chloromandelic acid 



[000279] 



Enzymes were identified which showed activity on 2- 



chloromandelonitrile. A high degree of overlap existed between the enzymes which were 
active on 2-chloromandelonitrile and phenylglycinonitrile. Many of these enzymes also 
formed a distinct sequence family. 



highest activity for the active enzymes. For the majority of the nitrilases, the 
enantioselectivity also increased at higher temperatures, particularly 38°C. The enzymes 
retained their activity in the presence of up to 25% methanol or 10% isopropanol; in many of 
these cases, the enantioselectivity was also enhanced. Activity in biphasic systems was 
largely comparable to aqueous conditions, particularly with hexane as the non-aqueous phase; 
varying tolerances to toluene were observed between the different nitrilases. 

Table 2. Summary of optimal conditions determined from characterization experiments 

for enantioselective hydrolysis of 2-chloromandelonitrile. 



[000280] 



Higher temperatures and neutral pH appeared to lead to the 
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SEQ ID 
NOS: 


Optimum pH 


Optiirmm 
Temp °C 


Solvent Tolerance 


385,386 


7 


38 


25% MeOH 


169, 170 


5 


38 


25%MeOH, 10% 
IPA 


185, 186 


7 


38 


25% MeOH, 10% 
IPA 


47, 48 


7 


38 


10% MeOH 


197, 198 


6 


55 


25% MeOH, 10% 
IPA 


187, 188 


7 


38 


10% MeOH; 40% 
IPA 


217,218 


7 


38 


25% MeOH, 10% 
IPA, 70% hexane, 
40% toluene 


55, 56 


7 


38 


10% MeOH, IPA, 
70% hexane 


167, 168 


9 


38 


10% MeOH, EPA, 
70% hexane 


15, 16 


7 


38 


25% MeOH, 10% 
IPA, 70% hexane, 
40% toluene 



Screening of nitrilases against target substrate (SVphenvllactic acid: 

^Y^y on „ ^ ^ - COOH 

Kj oh ~ 




Phenylacetaldehyde 
cyanohydrin 



HO H 
(S)-Phenyllactic acid 



[000281] Many of the nitrilases tested were active on phenaylacetaldehyde cyanohydrin. 
Many of these enzymes were part of two related sequence families and were distinct from 
those enzymes that were active on phenylglycinonitrile and chloromandelonitrile. 

[000282] The pH optima of the enzymes was generally above pH 7 (i.e. pH 8 or 9), with 
higher enantioselectivities being exhibited at these levels. Most of the enzymes showed 
superior activity at higher temperature, particularly 38°C. The effect of temperature on the 
enantioselectivities of the enzymes varied; in most cases, this property was slightly lower at 
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higher temperatures. While the enzymes were tolerant towards the addition of co-solvents, 
particularly 10% (v/v) methanol, no advantage in activity or enantioselectivity was gained by 
such additions. The use of a biphasic system was again shown to be feasible. 
[000283] Table 3. Summary of optimal conditions determined from characterization 
5 experiments for enantioselective hydrolysis of phenylacetaldehyde cyanohydrin 



SEQIDNOS: 


Optimum pH 


Optimum Temp C 


Solvent Tolerance 


103, 104 


7 


55 


10%MeOH, IPA 


99, 100 


8 


38 


10% MeOH, 70% 
hexane, toluene 


183, 184 


9 


38 


10%MeOH, IPA, 
70% toluene, 
hexane 


173, 174 


5 


38-55 


25% MeOH, IPA, 
70% hexane, 
toluene 


213,214 


7 


38 


10% MeOH, 25% 
IPA, 70% hexane, 
toluene 


61,62 


7 


38 


10% MeOH, 70% 
hexane, toluene 


205, 206 


8 


38-55 


10% MeOH, IPA, 
40% hexane, 
toluene 


207, 208 


8 


38 


hexane 


309, 210 


8 


38 


10%o MeOH, 40% 
hexane, toluene 


195, 196 


8 


38 


10% MeOH, 40% 
hexane, toluene 


43,44 


9 


38 


10% MeOH, 40% 
hexane 


161, 162 


9 


38 


25% MeOH, IPA, 
10% hexane, 
toluene 


175, 176 


6 


38-55 


10% MeOH, IPA, 
40% hexane 
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SEQ ID NOS: 


Optimum pH 


Optimum Temp °C 


Solvent Tolerance 


293, 294 


6 


38 


10%MeOH, IPA, 
40% hexane 



Screening of nitrilases against target substrate L-2-methylphenylglvcine 




2-methylphenylgIycinonitrile L-2-methylphenylglycine 

[000284] Nitrilases have shown activity on this substrate and preferentially yielded the D-2- 
methylphenylglycine, rather than the required L-2-methylphenylglcyine. 

Screening of nitrilases against target substrate L-hvdroxvnorleucine (fSV2-amino-6-hvdroxv 
hexanoic acid) 

NH 2 H 2 N JH 

5-hydroxypentanal L-hyclroxynorleucine 
aminonitrile 

[000285] A number of nitrilases, which showed activity on 2-amino-6-hydroxy 
hexanenitrile, were isolated. All of these enzymes showed enantioselectivity towards the L- 
isomer of the product. 

[000286] The enzymes all showed higher enantioselectivities at higher pH and appeared to 
more susceptible to the addition of solvents than the other nitrilases tested. Although activity 
was detected in the presence of organic solvents, it was generally lower than that of the 
aqueous control. Once again, the activity of the enzymes was negatively affected by the acid 
product and aldehyde starting material. 

Table 4. Summary of optimal conditions determined from characterization experiments for 
enantioselective hydrolysis of 2-amino-6-hydroxy hexanenitrile. 
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SEQID 
NOS: 


Optimum 
pH 


Optimum 
Temp °C 


Solvent 


217,218 


9 


38 


10% MeOH 


55, 56 


9 


38 


None 


187, 188 


9 


38 


10% MeOH 


167, 168 


9 


38 


None 


221, 222 


9 


38 





[000287] A range of hydxolytic activities was observed among the confirmed hit enzymes 
for 2-amino-6-hydroxy hexanenitrile. 

Screening; of nitrilases against target substrate 4-methvl-D-leucine and 4-methvl-L-leucine 



xrr 

NH 2 



CN 



3,3-dimethylbutanal 
aminonitrile 



xv COOH 

H 2 N H 
4-methy!-L-leucine 

.COOH 



H 2 N H 
4-methyl-D-leucine 

[000288] Hydrolysis of 2-amino-4,4-dimethyl pentanenitrile was performed by several of 
the nitrilases. Of these, some were shown to hydrolyse the nitrile to the L-isomer of the 
corresponding acid and were selected for further characterization. 

Table 5. Summary of optimal conditions determined from characterization experiments for 
enantioselective hydrolysis of 2-amino-4,4-dimethyl pentanenitrile 



SEQID 
NOS: 


Optimum pH 


Optimum 
Temp °C 


Solvent Tolerance 


103, 104 


7 


23 


25% MeOH, 10% EPA 


59, 60 


8 


23 


25% MeOH 


221, 222 


6 


38 


25% MeOH, 10% IPA 



Screening of nitrilases against target substrate GSycyclohexylmandelic acid 




Cyciohexylmandelonitrile (S)-cyclohexylmandelic acid 
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Screening of nitrilases against target substrate Mandelonitrile 




CN 




HO H 



OH 



COOH 



Mandelonitrile 



(R)-mandelic acid 



5 [000289] The nitrilase collection was also screened on mandelonitrile. The nitrilases 
actively hydrolyzed both phenylglycinonitrile and chloromandelonitrile. 

Enzymatic assay for determination of enantioselectivity 

[000290] In the design of a spectroscopic system for determination of the chiral a-hydroxy 
acids and a- amino acids, an enzyme based assay which permits the detection of product 

10 formation and enantioselectivity was developed and used. 

[000291] Spectroscopic systems for the detection of a-hydroxy- and for a- amino- acids 
based on lactate dehydrogenase (L-LDH & D-LDH) and on amino acid oxidase (L-AA Oxid 
& D-AA Oxid) are described in Figures 6 and 7. These enzymes were chosen because they 
are reported to have reasonably broad substrate ranges while still retaining near absolute 

1 5 enantiospecificity. 

[000292] The overall feasibility of this system has been established (Table 12). Neither the 
parent hydroxynitrile nor the aminonitrile is metabolized by the secondary or detection 
enzyme and thus starting material does not interfere. Cell lysate which is not heat treated 
results in background activity for the LDH system; however, heat inactivation eliminates the 

20 background activity. Cell lysate does not appear to interfere in the AA Oxidase assay. One 

concern is the inactivation of the AA Oxidase, which utilizes a FMN co-factor 9 by residual 

cyanide. However, the control studies indicated that at 2 mM PGN (which could release up to 

2 mM HCN) inactivation is not a problem. This assay is suitable for automation of 384 well 

(or possibly greater density) microliter plates. 

25 Table 6: Summary of Identification of Secondary Enzyme to Chiral Detection of Acid 
Product. 
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SUBSTRATE 


ENZYME WITH SUITABLE 
ACTIVITY FOUND FROM 
COMMERCIAL SOURCE 


Hydroxy Acid Products: 


L-lactic acid 


YES 


D-lactic acid 


YES 


L-phenyl lactic acid 


YES 


D-phenyl lactic acid 


YES 


S-cyclohexylmandelic acid L 


Not applicable 


R-cvclohexylmandelic acid 1 


Not applicable 


Amino Acid Products: 


4-methyl-L-leucine 


YES 


4-methyl-L/D-leucine 


YES (D-unknown) 


D-phenylalanine 


YES 


R-phenylglycine 


YES 


L-homophenyllactic acid 


YES 


D-homophenyllactic acid 


YES 


L-homophenylalanine 


YES 


D-homophenylalanine 


YES 


(S)~2-arrrino-6-hydroxy 
hexanoic acid 


YES 


(R/S)-2~amino-6-hydroxy 
1 hexanoic acid 


YES (D-unknown) 


L-methy Iphenylglycine 1 


1 . Not Applicable 


1 D-methylphenylglycine 1 


Not Applicable 



are not amenable to this particular oxidation 
5 Example 5: Standard assay conditions 

[000293] The following solutions were prepared: 

• Substrate stock solution: 50 mM of the aminonitrile substrate in 0.1 M phosphate 
buffer (pH 7) or 50 mM of the cyanohydrin substrate in 0.1 M Na Acetate buffer (pH 5) 

• Enzyme stock solution: 3.33 ml of 0.1 M phosphate buffer (pH 7) to each vial of 20 
10 mg of lyophilized cell lysate (final concentration 6 mg protein/ml) 



[000294] Procedure: 

• Add 100 ul of the 50 mM substrate solution to the appropriate number of wells of a 

96-well plate 
15 • Add 80 ul of buffer to each well 

• Add 20 ul of enzyme solution to each well 

• Blank controls were set up by substitution of 20 ul of buffer for the enzyme solution 
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• Negative controls consisting of 20 \xl of enzyme solution in 180 jlxI of buffer were 
also included in many of the experiments. Once it had been established that the cell lysate 
did not interfere with the detection of the products, these controls were not included. 
[000295] Sampling of reactions: 

5 • The reactions were sampled by removing an aliquot from each well (15-50 jliI) and 

diluting the samples as follows: 

• Samples for non-chiral HPLC analysis: 

• Phenylglycine, 2-chloromandelic acid and phenyllactic acid: initially, the samples 
were diluted 2-fold with water and a further 2-fold with methanol or acetonitrile (final 

10 dilution: 4-fold). It was found that an 8-fold dilution of these samples led to improved 
chromatographic separation 

• (S)-2-amino-6-hydroxy hexanoic acid, 4-methylleucine, ^-leucine, 2- 
methylphenylglycine and cyclohexylmandelic acid: samples were diluted 1:1 with methanol 
or acetonitrile. The choice of solvent was based on the solvent used in the HPLC analysis 

15 method. 

[000296] • Samples for chiral HPLC analysis: 

• Phenylglycine, 2-chloromandelic acid and phenyllactic acid: as described above for 
the non-chiral analyses, the samples for chiral analyses were initially diluted 2-fold and in the 
later stages of the project, at 4-fold. 

20 • (S)-2-amino-6-hydroxy hexanoic acid, 4-methylleucine ? ^-leucine, 2- 

methylphenylglycine: samples were diluted 1:1 with methanol or acetonitrile. 
[000297] • For each experiment, a standard curve of the product was included in the HPLC 
run. The curve was plotted on an X-Y axis and the concentration of product in the samples 
calculated from the slope of these curves. 

25 [000298] • For the preliminary characterization experiments, samples were taken such that 
the activity of the enzymes was in the linear phase; this was performed so that differences in 
the effects of the parameters on the rate of reaction, rather than the complete conversion, 
could be determined. The sampling times are denoted in the tables included in the text. 
[000299] * The samples were analyzed by HPLC, using the methods outlined in Table 20 

JO and 21. 
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Example 6: Determination of the Effect of pH on enzyme activity and enantioselectivity 

[000300] The effect of pH on the enzyme activity and enantioselectivity was studied by 
performance of the standard assay in a range of different buffers: 
0.1 M Citrate Phosphate pH 5 
5 0. 1 M Citrate Phosphate pH 6 

0.1 M Sodium Phosphate pH 7 
O.lMTris-HCl pH 8 
MTris-HClpH9 

10 [000301] The samples were analyzed by non-chiral and chiral HPLC methods and examples 
of the results are presented in Tables 5, 8 and 1 1 herein. 

Example 7: Determination of the effect of temperature on enzyme activity and 
enantioselectivity 

1 5 [000302] The effect of temperature on the activity and enantioselectivity was investigated 
by performing the standard assay at room temperature, 38°C and 55°C. The samples were 
analyzed by non-chiral and chiral HPLC methods and examples of the results are given in 
Tables 5, 8 and 1 1 herein. 

Example 8: Determination of the Effect of solvents on enzyme activity and enantioselectivity 

20 [000303] The enzyme reactions were performed in the presence of cosolvents and as 

biphasic systems, in order to investigate the effect of water-miscible and water-immiscible 
solvents on the enzymes. In the presence of cosolvents, the reactions were run under 
standard conditions, with substitution of the buffer with methanol or isopropanoL The final 
concentrations of solvent in the reactions was 0, 10 5 25 and 40% (v/v). 

25 [000304] The biphasic reactions were also carried out under standard conditions, with a 
layer of water-immiscible organic solvent forming the nonaqueous phase. The solvent was 
added at the following levels: 0%, 10%, 40% and 70% (v/v) of the aqueous phase. The 
samples from these reactions were evaporated by centrifugation under vacuum and 
redissolved in a 50:50 mixture of methanol or acetonitrile and water. The samples were 

30 analyzed by non-chiral and chiral HPLC methods. 
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Example 9: Determination of the Effect of process components on 
enzyme activity and enantioselectivitv 

Activity 

[000305] The effect of the process components on the activity of the enzymes was 
established by addition of the individual components to the enzymatic reaction. These 
components included the starting materials for the nitrile synthesis, aldehyde, cyanide and 
ammonium, as well as triethylamine, which is added in catalytic amounts to the nitrile 
synthesis reaction. The concentrations of the reactants were selected with possible process 
conditions in mind and were adapted to the levels of reactants used in the enzyme assays. In 
some cases, the solubility of the aldehydes and products was relatively low; in these cases, 
the highest level of solubility was added to the reactions as the highest level and 10% of this 
level as the lower value. 

[000306] The enzymatic reactions were carried out under standard conditions, with addition 
of one or more of the following components: benzaldehyde, phenylglycine, 
phenylacetaldehyde, phenyllactic acid, 2-chlorobenzaldehyde, 2-chloromandelic acid, 5- 
hydroxypentanal, (S)-2-amino-6-hydroxy hexanoic acid, 4-methylleucine, KCN, 
Triethylamine, NH 4 C1. Control reactions were performed under standard conditions, with no 
additive. The samples were analyzed by non-chiral HPLC. 
Stability 

[000307] The stability of the enzymes to process conditions was monitored by incubation of 
the enzymes in the presence of the individual reaction components for predetermined time 
periods, prior to assay of the enzyme activity under standard conditions. In these 
experiments, the enzymes were incubated at a concentration of 1 .2 mg protein/ml in the 
presence of each of the following reaction components: methanol, benzaldehyde, 
phenylglycine, phenylacetaldehyde, phenyllactic acid, 2-chlorobenzaldehyde, 2- 
chloromandelic acid, 5-hydroxypentanal, (S)-2~amino-6-hydroxy hexanoic acid, KCN, 
NH4CI. 

Assay conditions: 

[000308] At 0, 2, 6 and 24 hours of incubation in the particular additive, 50 jjl of the 
enzyme solution was removed, 50 jal of a 50 mM substrate stock solution added and the 
enzyme activity assayed under standard conditions. After substrate addition, the reactions 
were sampled at the following times: Phenylglycinonitrile: 10 mins; Phenylacetaldehyde 
cyanohydrin: 1 hour; 2-chloromandelonitrile: 2 hours. Control reactions were performed by 
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incubation of the enzyme in buffer only. The samples were analyzed using non-chiral HPLC 
methods. 

Example 10: Confirmation of putative hit enzymes 

[000309] Following the preliminary characterization experiments, the enzymes which were 
identified as putative hits were assayed under the optimal conditions determined, in order to 
evaluate their performance, especially in terms of enantioselectivity, when higher conversions 
were attained. The enzymes were assayed with 25 mM substrate, tinder the conditions of pH 
and temperature noted in the tables included in the text. A standard concentration of 0.6 
mg/ml protein was used for each of the enzymes, unless otherwise stated. 

Example 11: Selected examples of chromatograms from enzyme reactions 

[000310] In this section, representative examples of chromatograms for each substrate and 
product combination will be shown, together with a discussion of some of the challenges 
encountered with the methods and how they were addressed. 
D-Phenvl glycine 

[00031 1] Non-chiral analysis showing the substrate peak eluting at 2.6 min and 3.2 min. 
See Figures 8A-8E. The two peaks were present in all samples containing higher 
concentrations of the nitrile; the second peak is thought to be a product associated with the 
nitrile; it decreased with time and was no longer present once complete conversion to the 
product had taken place. The chromatogram shown in Figure 8 A is a blank control, 
containing only nitrile and buffer; the samples were all diluted with water and solvent as 
explained in section 1 above. This was repeated for all samples discussed below. An 
enzymatic reaction sample is shown in the chromatogram in Figure 8B , with the product 
eluting at 0.4 min. 

[000312] Of note in these chromatograms is the small solvent front peak eluting at 0.3 min. 
Further representation of this peak is given in the chromatogram shown in Figure 8C, in 
which a negative control consisting of cell lysate in buffer, was run. A very small peak 
coeluted with the product at 0.4 min. In the initial phase of the project, this peak was 
regarded as problematic, although the appropriate controls were run with each experiment for 
in order to maintain accuracy. In these experiments, the peak area resulting from the cell 
lysate, although it was relatively small, was subtracted from the peak areas of the product in 
the enzymatic reactions. Improvement of this analysis was obtained by further dilution of the 
samples and the use of lower injection volumes on the HPLC. Following the implementation 
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of these improvements, interference by this peak was shown to be minimal, as shown in the 
chromatogram illustrated in Fig. 6C. 

[000313] The chiral analysis of phenylglycine is shown in chromatogram in Fig. 6D with 
the L-enantiomer eluting at 6 min and the D-enantiomer at 1 1 min. Good resolution between 
the two isomers was obtained. However, the column used was very sensitive and the 
characteristics of the column appeared to change over time, resulting in changes in the elution 
times of the acids. While this was easily detected by the use of the proper controls and 
standards, a greater problem existed in the coelution of the nitrile peak with the D-enantiomer 
(chromatogram shown in Fig. 6E). The cause of this coelution was unclear; however, it was 
easily detected by the use of appropriate standards; in addition, the UV spectrum of the acid 
was very distinctive, making the use of this tool effective in detecting the coelution. The 
problem was also easily resolved by adjusting the methanol content in the mobile phase. 
(ffi-2-chloromandelic acid 

[000314] The HPLC analysis of chloromandelic acid and chloromandelonitrile offered 
many of the challenges associated with the analysis of the phenylglycine samples. From the 
chromatogram shown in Fig. 7 A, which contains only chloromandelonitrile in buffer, it is 
evident that a peak eluted at the same time as the product in the chromatogram shown in Fig. 
7B, which represents a chloromandelic acid standard. The contribution of the cell lysate to 
this peak was found to be small; it would appear that the greatest contribution to this peak 
was from the chloromandelonitrile, either from a breakdown product or a contaminant in the 
nitrile preparation. The peak area remained constant throughout each experiment and, using 
the appropriate controls, it was found that subtraction of the peak area from that of the 
product yielded sufficient accuracy. Many attempts were made to change the HPLC 
conditions so that the product peak eluted at a later time; however, these attempts were not 
successful. Chromatogram shown in Fig. 7C illustrates the appearance of product and the 
reduction of the substrate peaks. 

[000315] The chiral analysis of chloromandelic acid was almost problem-free. The elution 
of a small peak at the same time as the (£)-enantiomer presented some concern (the peak at 
2.4 min in chromatogram shown in Fig. 7D). However, once it was established that this peak 
was present in all the samples at the same level, including the blank control, and that it had a 
different UV spectrum to that of the chloromandelic acid peak, it was not regarded as a 
problem. Consequently, it was subtracted from the peak eluting at 2.4 min in each sample. 
The (R)-enantiomer eluted at 3 minutes. 
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[000316] The analysis of phenyllactic acid was initially plagued with the same problems 
discussed for phenylglycine and 2-chloromandelic acid. However, in this case, adjustment of 
the solvent concentration in the nonchiral HPLC method led to a shift in the retention time of 
5 the acid, so that it no longer coeluted with the cell lysate peak. Following this, no problems 
were encountered with either the nonchiral or chiral methods. Representative nonchiral 
chromatograms of the product (1.9 min) and cyanohydrin substrate (3.7 min) are shown in 
Fig. 8 A, while the chiral analysis of the acid is shown in Fig. 8B, with the L-enantiomer 
eluting at 2 min and the opposite enantiomer at 6 min. 

10 L-2-methvlphenvlglvcine 

[000317] The analysis of methylphenylglycine was unproblematic, although the nonchiral 
method did not provide baseline separation between a cell lysate peak and the product peak, 
as shown in the chromatogram illustrated in Fig. 9A. The amino acid standard for this 
method was provided in the final stages of the project, thus minimizing the time for method 

15 development. In the chromatogram shown in Fig. 9 A the amino acid elutes at 0.7 min and 
the aminonitrile at 5.0 min. Sufficient separation between the two initial peaks was obtained 
to allow the calculation of approximate conversion to product. 

[000318] The chiral analysis of this compound provided good separation between the two 
enantiomers, as shown in the chromatogram illustrated in Fig. 9B. The L-enantiomer elutes 
20 at 5 min and the D-enantiomer at 8 min. 
L-ferf-leucine 

[000319] For the nonchiral analysis of /-leucine, the cell lysate presented the most serious 
problem amongst the group of products for this project. This was compounded by the low 
spectroscopic properties of the amino acid, leading to difficulty in differentiating the product 

15 peak from the cell lysate. Good separation of the individual product enantiomers was 

obtained by chiral analysis as shown in Fig. 10A. During the primary screen, a small peak 
eluted at the same time as the L-amino acid standard in certain samples (see Fig. 10B) and 
was thought to be the amino acid. However, further development of the method and the use 
of the appropriate controls established that this peak was actually a cell lysate peak. 

0 [000320] The aminonitrile eluted between the two /-leucine peaks, as shown in Fig. 10C; 
this chromatogram also shows the cell lysate peak at 4.8 min. The UV spectrum of the 
nitrile was distinct from that of the amino acid, making it easier to differentiate from the acid 
peaks. 

86 



WO 03/097810 PCT/US03/15712 
L-hvdroxvnorleucine (fSV2-amino-6-hvdroxv hexanoic acid) 

[000321] The chiral analysis of (S)-2-amino-6-hydroxy hexanoic acid was consistent and 
reliable. By contrast, the nonchiral method presented many problems, primarily as a result of 
non-separation between the nitrile and the acid peaks. Towards the latter half of the project, a 
method was developed and used successfully for the confirmation of activities. Prior to this, 
most of the analysis was performed using the chiral method; standard curves of the products 
were run in order to quantify the reactions. A representative chromatogram of (S)-2-amino- 
6-hydroxy hexanoic acid is shown in Figure 1 1 A, with (S)-2-amino-6-hydroxy hexanoic acid 
eluting at 6 min. The aminonitrile was not detected by this method. 
[000322] Separation of the individual 2-amino-6-hydroxy hexanoic acid enantiomers is 
shown in Fig. 1 IB. The L-enantiomer elutes first, at 2 min, followed by the D-enantiomer at 
3 min. In Fig. 1 1C, an enzymatic sample is represented; the only area of slight concern is the 
negative peak preceding the elution of the L-enantiomer. However, it did not appear to 
interfere significantly with the elution of this enantiomer; method development did not 
eliminate the negative peak. 
4- methvl-D-leucine and 4-methvl-L-leucine 

[000323] For the detection of 4-methylleucine, the chiral HPLC method again proved more 
reliable. The combination of low activities, together with the low sensitivity of the method to 
the compound led to difficulties in detection using nonchiral HPLC. A 2.5 mJVl standard of 
the amino acid is shown in Fig. 12A, with a peak height of approximately 40 mAU; this was 
substantially lower than those detected for the aromatic compounds. Chromatogram in Fig. 
12B shows an enzymatic sample, in which conversion was detected using the chiral HPLC 
method; while it is not clear, it would appear that the 4-methylleucine peak elutes at 2.7 min 
and is extremely low in both peak height and area. This peak did not appear in samples 
which were negative by chiral HPLC analysis. 

[000324] The chiral analysis of 4-methyl-L-leucine and 4-methyl-D-leucine did not present 
any problems. The L-enantiomer eluted at 5 min and the D-enantiomer at 7 min, although 
some peak shift did occur, as a result of the sensitivity to the column, described in section (i) 
for phenylglycine. In chromatograms shown in Figs. 14C-14D, the separation of these amino 
acids is shown; the first sample represents an enzyme which produced both enantiomers and 
in the second sample, the enzyme preferentially hydrolyzed the L-enantiomer, with a small 
amount D-amino acid forming. 
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CSVcvclohexvlmandelic acid 

[000325] Chromatograms of the standards for cyclohexylmandelic acid (Fig. 13 A) and the 
corresponding nitrile (Fig. 13B) are shown. The acid eluted at 1.3 min, while the 
cyanohydrin was observed at 2.5 min. The peak eluting at 2. 1 min is thought to be the 
5 cyclohexylphenylketone, as shown by the elution of a ketone standard at this point. 

Example 12: An Enzyme Library Approach to Biocatalvsis: Development of aNitrilase 
Platform for Enantioselective Production of Carboxylic Acid Derivatives 

[000326] Biocatalytic processes can offer unique advantages in transformations that are 

challenging to accomplish through conventional chemical methods (Wong, C.-H.; 

10 Whitesides, G.M. Enzymes in Synthetic Organic Chemistry; Pergamon, New York, 1994; 

Drauz, K.; Waldmann, H., Roberts, S.M. Eds. Enzyme Catalysis in Organic Synthesis; VCH: 
Weinheim, Germany, 2nd ed., 2002). Nitrilases (EC 3.5.5.1) promote the mild hydrolytic 
conversion of organonitriles directly to the corresponding carboxylic acids (Kobayashi, M.; 
Shimizu, S. FEMS Microbiol Lett. 1994, 120, 217; Bunch, A.W. In Biotechnology; Rehm, 

15 H.-J.; Reed, G.; Puhler, A.; Stadler, P., Eds.; Wiley- VCH: Weinheim, Germany, Vol. 8a, 
Chapter 6, pp 277-324; Wieser, M.; Nagasawa, T. In Stereoselective Biocatalysis; Patel, 
R.N., Ed.; Marcel Dekker: New York, 2000, Chapter 17, pp 461-486.) Fewer than fifteen 
microbially-derived nitrilases have been characterized and reported to date. (Harper, D.B. Int. 
J. Biochem. 1985, 17, 677; Levy-Schil, S.; Soubrier, F.; Crutz-Le Coq, A.M.; Faucher, D.; 

20 Crouzet, J.; Petre, D. Gene 1995, 161, 15; Yu, F. 1999, US Patent 5872000; Ress-Loschke, 
M.; Friedrich, T.; Hauer, B.; Mattes, R.; Engels, D. PCT Appl. WO 00/23577, April 2000.). 
Several nitrilases previously have been explored for the preparation of single-enantiomer 
carboxylic acids, although little progress has been made in the development of nitrilases as 
viable synthetic tools. This application describes the discovery of a large and diverse set of 

15 nitrilases and herein demonstrate the utility of this nitrilase library for identifying enzymes 
that catalyze efficient enantioselective production of valuable hydroxy carboxylic acid 
derivatives. 

[000327] In an effort to access the most diversified range of enzymes that can be found 
in Nature, we create large genomic libraries by extracting DNA directly from environmental 
\0 samples that have been collected from varying global habitats. (For a description of these 
methods, see: Short, J.M. Nature Biotech. 1997, 15, 1322; Handelsman, J.; Rondon, M.J.; 
Brady, S.F.; Clardy, J.; Goodman, R.M. Chem. Biol 1998, 5, R245; Henne, A.; Daniel, R.; 
Schmitz, R.A.; Gottschalk, G. Appl Environ. Microbiol. 1999, 65, 3901.). We have 
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established a variety of methods for identifying novel activities through screening mixed 
populations of uncultured DNA. (Robertson, D.E.; Mathur, E J.; Swanson, R,V.; Marrs, 
B.L.; Short, J.M. SIM News 1996, 46, 3; Short, J.M. US Patent 5,958,672, 1999; Short J.M. 
US Patent 6,030,779, 2000.) Through this approach, nearly 200 new nitrilases have been 
discovered and characterized. (For a concise description of the studies, see Materials and 
Methods section below.) All nitrilases were defined as unique at the sequence level and were 
shown to possess the conserved catalytic triad Glu-Lys-Cys which is characteristic for this 
enzyme class. (Pace, H.; Brenner, C. Genome Biology 2001, 2, 0001.1-0001.9.) Each 
nitrilase in our library was overexpressed and stored as a lyophilized cell lysate in order to 
facilitate rapid evaluation of the library for particular biocatalytic functions. 
[000328] The initial investigations focused upon the efficacy of nitrilases for production of 
a-hydroxy acids 2 formed through hydrolysis of cyanohydrins 1. Cyanohydrins are well- 
documented to racemize readily under basic conditions through reversible loss of HCN. 
(Inagaki, M.; Hiratake, J.; Nishioka, T.; Oda, J.; J. Org. Chem 1992, 57, 5643. (b) van 
Eikeren, P. US Patent 5,241,087, 1993.) Thus, a dynamic kinetic resolution process is 
possible whereby an enzyme selectively hydrolyzes only one enantiomer of 1, affording 2 in 
100% theoretical yield and with high levels of enantiomeric purity. 

[000329] One important application of this type involves commercial production of (R)- 
mandelic acid from mandelonitrile. (Ress-Loschke, M.; Friedrich, T.; Hauer, B.; Mattes, R.; 
Engels, D. PCT Appl. WO 00/23577, April 2000; Yamamoto, K.; Oishi, K.; Fujimatsu, L; 
Komatsu, K. Appl Environ. Microbiol 1991, 57, 3028; Endo, T.; Tamura, K. US Patent 
5,296,373, March 1994.) Mandelic acid and derivatives find broad use as intermediates and 
resolving agents for production of many pharmaceutical and agricultural products. (Coppola, 
G.M.; Schuster, H.F. Chiral a-Hydroxy Acids in Enantioselective Synthesis; Wiley- VCH: 
Weinheim, Germany: 1997.) However, the few known nitrilases derived from cultured 
organisms have not been found useful for efficient and selective hydrolysis of analogous 
substrates. 
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[000330] The nitrilase library was screened for activity and enantioselectivity in the 
hydrolysis of mandelonitrile (3a, Ar = phenyl) to mandelic acid. Preliminary results revealed 
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I 385, 386 I 

Ar^C N " Ar^C0 2 H 

3 (R)-4 

that 27 enzymes afforded mandelic acid in >90% ee. One enzyme, SEQ ID NOS:385, 386, 
was studied in greater detail and was found to be very active for hydrolysis of mandelonitrile. 
Under standard conditions using 25 mM 3a and 0.12 mg/mL enzyme in 10% MeOH (v/v) 0.1 
M phosphate buffer at 37oC and pH 8, (i?)-mandelic acid was formed quantitatively within 
5 10 min and with 98% ee. To confirm synthetic utility, the reaction was performed using 1.0 g 
3a (50 mM) and 9 mg nitrilase (0.06 mg/mL nitrilase I); after 3 h (R)-mandelic acid was 
isolated in high yield (0.93 g, 86%) and again with 98% ee. 

(a) Reactions were conducted under standard conditions (see text). Reaction time for 
complete conversion to 4 was 1-3 h. Entries 8-9 were conducted at pH 9 and 5 mM substrate 
10 concentration, (b) Specific activities were measured at 5 min transformation timepoints and 
are expressed as [imol mg' 1 min" 1 . (c) TOF = turnover frequency, mol product/mol 
catalyst/sec. (d) Enantioselectivites were determined by chiral HPLC analysis. Hydroxy acids 
were isolated and absolute configurations were determined to be (R) in all cases. 

[000331] the substrate scope of SEQ ID NOS:385, 386 was next explored. As shown 
15 in Table 13, a broad range of mandelic acid derivatives as well as aromatic and 

heteroaromatic analogues (4) may be prepared through this method. SEQ ID NOS:385, 386 
tolerates aromatic ring substituents in the ortho-, met a-, and ^arm-positions of mandelonitrile 
derivatives and products of type 4 were produced with high enantioselectivities. Other larger 
aromatic groups such as 1-naphthyl and 2-naphthyl also are accommodated within the active 
20 site, again affording the acids 4 with high selectivity (Table 13, entries 8-9). Finally, 3- 
pyridyl and 3-thienyl analogues of mandelic acid were prepared readily using this process 
(Table 13, entries 10-11). This is the first reported demonstration of a nitrilase that affords a 
range of mandelic acid derivatives and heteroaromatic analogues of type 4. High activity on 
the more sterically encumbered ort/zo-substituted and 1-naphthyl derivatives is particularly 
25 noteworthy. 
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[000332] We next examined the preparation of aryllactic acid derivatives 6 through 
hydrolysis of the corresponding cyanohydrins 5. Phenyllactic acid and derivatives serve as 
versatile building blocks for the preparation of numerous biologically active compounds. 
(Coppola, G.M.; Schuster, H.F. Chiral a-Hydroxy Acids in Enantioselective Synthesis; 
Wiley- VCH: Weinheim, Germany: 1997.) Upon screening our nitrilase library against the 
parent cyanohydrin 5a (Ar = phenyl), we found several enzymes that provided 6a with high 
enantiomeric excess. One enzyme, SEQ ID NOS: 103, 104, was further characterized. After 
optimization, SEQ ID NOS: 103, 104, was shown to provide (5)-phenyllactic acid (6a) with 
complete conversion (50 mM) and very high enantioselectivity (98% ee) over 6 h. The 
highest enantioselectivity previously reported for biocatalytic conversion of 5 to 6 was 75% 
ee achieved through a whole cell transformation using a Pseudomonas strain. (Hashimoto, 
Y.; Kobayashi, E.; Endo, T.; Nishiyama, M.; Horinouchi, S. Biosci. Biotech. Biochem. 1996, 
60, 1279.) 

Table 7. Nitrilase II-catalyzed production of aryllactic acid derivatives and analogues 6 a 



Entry Arin6 Spec. Act. 6 TOF c % ee<* 

1 C 6 H 5 25 16 99 

2 2-Me-C 6 H 5 160 100 95 



3 2-Br-C 6 H 5 121 76 95 

4 2-F-C 6 H 5 155 97 91 

5 3-Me-C 6 H 5 21 13 95 

6 3-F-C 6 H 5 22 14 99 
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7 1-naphthyl 64 40 96 

8 2-pyridyl 10.5 6.6 99 

9 3-pyridyl 11.6 7.2 97 

10 2-thienyl 3.4 2.1 96 

11 3-thienyl 2.3 1.4 97 



(a) Reaction conditions as in Table 13, except 0.016 mg/mL nitrilase was used. Full 
conversion to 6 was observed within 6 h. (b)-(d) See Table 13. The absolute configuration 
was determined to be (S) for phenyllactic acid and entries 2-11 were assigned (S) based upon 
identical chiral HPLC peak elution order. 

[000333] Ortho and meta substituents appear to be tolerated well by nitrilase II, with 
ortho substituted derivatives surprisingly being converted with higher rates relative to the 
parent substrate 5a. Novel heteroaromatic derivatives, such as 2-pyridyl-, 3-pyridyl, 2- 
thienyl- and 3-thienyllactic acids, were prepared with high conversions and 
enantioselectivities (entries 8-1 1). Unexpectedly, para substituents greatly lowered the rates 
of these reactions, with full conversion taking over two weeks under these conditions. 

[000334] The final transformation that we examined was desymmetrization of the 
readily available prochiral substrate 3-hyckoxyglutarylnitrile (7) (Johnson, F.; Panella, J.P.; 
Carlson, A. A. J. Org. Chem. 1962, 27, 2241) to afford hydroxy acid (R)-8 which, once 
esterified to (i?)-9, is an intermediate used in the manufacture of the cholesterol-lowering 
drug LIPITOR™. Previously reported attempts to use enzymes for this process were 
unsuccessful and 8 was produced with low selectivity (highest: 22% ee) and the undesired 
(^-configuration. (Crosby, J.A.; Parratt, J.S.; Turner, N.J. Tetrahedron: Asymmetry 1992, 3, 
1547; Beard, T.; Cohen, M.A.; Parratt, J.S.; Turner, N.J. Tetrahedron: Asymmetry 1993, 4, 
1085; Kakeya, H.; Sakai, N.; Sano, A.; Yokoyama, M.; Sugai, T.; Ohta, H. Chem. Lett. 1991, 
1823.) 
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[000335] The nitrilase library was screened and unique enzymes were discovered and 
isolated that provided the required product (i?)-8 with high conversion (>95%) and >90% ee. 
Using one of the (/?)-specific nitrilases, this process was operated on a 1.0 g scale (240 mM 7, 
30 mg enzyme, 22°C, pH 7) and after 22 h, (R)-8 was isolated in 98% yield and 95% ee. 
Interestingly, the same screening program also identified nitrilases that afford the opposite 
enantiomer (5)-8 with 90-98% ee. Thus, the extensive screen of biodiversity has uncovered 
enzymes that provide ready access to either enantiomer of the intermediate 8 with high 
enantioselectivities. Our discovery of the first enzymes that furnish (i?)-8 underscores the 
advantage of having access to a large and diverse library of nitrilases. 

[000336] By plumbing our environmental genomic libraries created from uncultured 
DNA, we have discovered a large array of novel nitrilases. This study has revealed specific 
nitrilases that furnish mandelic and aryllactic acid derivatives, as well as either enantiomer of 
4-cyano-3 -hydroxybutyric acid in high yield and enantiomeric excess. 

Procedures and Analytical Data: 

[000337] Hydroxyglutarylnitrile was purchased from TCI America and used as 

received. Amino acids used for the preparation of aryl lactic acid standards were purchased 

from PepTech (Cambridge, MA). (i?)-3-hydroxy-4-cyanobutyric acid was obtained from 

Gateway Chemical Technology (St. Louis, MO). Both (R)~ and (S)~ mandelic acid and (R)- 

and (S)- phenyl lactic acid standards were purchased from Sigma Aldrich. All other reagents 

were purchased from Sigma Aldrich and utilized without further purification. Silica Gel, 70- 

230 mesh, 60 A, purchased from Aldrich, was used for chromatographic purifications. All l H 

NMRs and 13 C NMRs were run on Bruker model AM-500 machines, set at room temperature, 

500 MHz and 125MHz respectively for l H and 13 C. Mass analyses and unit mass resolution 

was achieved by flow injection analysis (FIA) using a Perkin-Elmer Sciex API-4000 

TURBOION™ Spray LC/MS/MS system. The LC flow was provided by Schimadzu LC- 

lOAdvp pumps, with 0.05% acetic acid and MeOH. Injections were accomplished via a 

Valco injector valve. The HPLC analysis was done on an Agilent 1 100 HPLC with Astec's 

Chirobiotic R column (100 x 4.6 mm, cat no. 13022 or 150 x 4.6 mm, cat no. 13023) or 

Daicel's Chiralcel OD column (50 x 4.6 mm, cat no. 14022) and the DAD detector set at 210, 
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220, 230, and 250 nm. For specific rotations, a Perkin Elmer Model 341 Polarimeter was 
used, set at 589 nm, Na lamp, at room temperature, with a 100 mm path length cell. 
Concentrations for specific rotation are reported in grams per 100 mL of solvent. 
Microbiology techniques were executed in accordance to published protocols. (Sambrook, J. 
5 Fritsch, EF, Maniatis, T. (1989) Molecular Cloning: A Laboratory Manual (2nd ed.), Cold 
Spring Harbor Laboratory Press, Plainview NY.) Glycolic acid products were isolated and 
absolute configurations were determined to be (R) in all cases by comparison with literature 
optical rotation data on configurationally defined compounds except for (~)-3-pyridylglycolic 
acid, which to our knowledge is not known as a single enantiomer. (For mandelic, 2- 

10 chloromandelic, 2-methyl mandelic, 3-chloromandelic, 3-bromomandelic and 4- 

fluoromandelic acid see Hoover, J.R. E.; Dunn, G. L.; Jakas, D.R.; Lam, L.L.; Taggart, J. J.; 
Guarini, J.R.; Phillips, L. J. Med. Chem. 1974, 17(1), 34-41; For 2-bromo mandelic acid see 
Collet, A.; Jacques, J.; Bull. Soc. Chem. Fr. 1973, 12, 3330-3331; For 1- and 2- 
napthylglycolic acid see Takahashi, I; Y. Aoyagi, L Nakamura, Kitagawa, A., Matsumoto, 

15 K., Kitajima, H. Isa, K. Odashima, K. Koga, K. Heterocycles 1999, 51(6), 1371-88; For 3- 
thienylglycolic acid Gronowitz, S. Ark Kemi, 1957, 11, 519-525.) 

[000338] For the aryl lactic acid products, absolute configuration was established to be 
(S) for phenyl lactic acid by comparison with literature optical rotation and for all other 
phenyl lactic acid products, absolute configurations were predicted based upon elution order 
20 using chiral HPLC. Absolute configuration for 3-hydroxy-4-cyano-butanoic acid was 
established by derivatization to (i?)-(-)-Methyl (3-0-[benzoyl]-4-cyano)-butanoate and 
comparison to literature optical rotation data on configurationally defined compound. (3. 
Beard, T. Cohen, M. A. Parratt, J.S. Turner, N. J. Tetrahedron :Asy mm. 4(6), 1993, 1085- 
1104.) 

25 Nitrilase Discovery and Characterization Methods: 
L Nitrilase Selection. 

[000339] An Escherichia coli screening host strain, SEL700, was optimized for nitrilase 
selections on a nitrile substrate. An Abs 6 oonm = 1, resuspension of SEL700 screening host in 
10 mM MgS04 was infected with kanamycin-resistant environmental DNA library for 45 
30 minutes at 37°C, such that complete screening coverage of the library was achieved. Infected 
cells, now denoted by kanamycin resistance, were plated on kanamycin LB plates and 
allowed to grow overnight at 30°C. Titer plates were also made to determine infection 
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efficiency. Cells were pooled, washed, and resuspended the next morning with 10 mM 
MgS0 4 . Transformed clones were inoculated into M9 media (without nitrogen) with 10 mM 
of nitrile substrate. M9 media consisted of IX M9 salts (NH 4 C1 omitted), O.lmM CaCl 2? 1 
mM MgS0 4 , 0.2 % glucose, and approximately 10 mM of a nitrile selection substrate. The 
selection cultures were then incubated at 30°C, shaking at 200 rpm, for up to five weeks. 
Positive nitrilase cultures were identified by growth, due to positive clone's ability to 
hydrolyze nitrile substrate. Positive clones were isolated by streaking out a selection culture 
with growth and subsequent secondary culturing of isolated colonies in the same defined 
media. The DNA from any positive secondary cultures exhibiting re-growth was then 
isolated and sequenced to confirm discovery of a nitrilase gene and to establish the unique 
nature of that gene. 

2. Nitrilase Biopanning. 

[000340] Traditional filter lift hybridization screening protocols are limited to libraries 
with approximately 10 6 to 10 7 members. Attempting to screen one library would require 
approximately 5,000 filter lifts. Therefore, solution phase and other biopanning formats have 
been developed for ultra high throughput sequence based screening permitting rapid 
screening of up to 10 8 member environmental libraries In the solution format, the DNA 
from a large number of library clones is mixed with tagged molecules of interest under 
conditions which promote hybridization. The tagged clones and hybridized DNA are then 
removed from solution and washed at some level of stringency to remove clones which do 
not have sequence identity with the probe. The hybridized DNA is then eluted and recovered. 
Clones of interest are sequenced and cloned to provide enzyme activities of interest. This 
method has been demonstrated to achieve up to 1,000-fold enrichment per round for 
sequences of interest. 

3. High Throughput Nitrilase Activity Assay. 

[000341] Activity assays were conducted using 25 mM (-3 mg/mL) substrate, 0.1 
mg/mL nitrilase in 0.25 mL of assay solution. Assay solutions consisted of 0-10% (v/v) 
MeOH in 0.1 M sodium phosphate buffer solution at pH 7 to 9 and temperatures 37°C or 
22°C. Specific activities were measured at 5 min transformation time point, unless otherwise 
noted, and are expressed in units jumol mg' 1 min" 1 . Enantiomeric excess and conversion rates 
were determined by high throughput HPLC analysis comparing enzyme product 
concentration to standard curves of racemic acid products. Analytical conditions for the 
products are tabulated below. 
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Analytical Methods: 





Acid Product 


Column 


Liquid Chromatography Method 


Retention Times of 
enantiomers (min) 


1.1 


mandelic acid 


Chirabiotic R 
100x4.6 mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


2.4 (S); 2.9 (R) 


1.2 


2-Cl-mandelic 
acid 


Chirabiotic R 
100x4.6 mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


2.3 (S); 2.9 (R) 


1.3 


2-Br-mandelic 
acid 


Chirabiotic R 
100x4.6 mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


2.8; 4.0 


1.4 


2-CH 3 -mandelic 
acid 


Chirabiotic R 
100x4.6 mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


3.1; 3.8 


1.5 


3-Cl-mandelic 


Chirabiotic R 
100x4.6 mm 


10%[0.5% AcOH], 90% CH 3 CN 
1 ml/min 


3.1; 3.8 


1.6 


3-Br-mandelic 


Chirabiotic R 
100x4.6 mm 


10%[0.5% AcOH], 90% CH 3 CN 
1 ml/min 


3.3; 3.9 


1.7 


4-F-mandelic 


Chirabiotic R 
150 x 4.6 mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


3.7; 4.8 


1.8 


1 -napthylglycolic 
acid 


Chirabiotic R 
100x4.6 mm 


4%[0.5% AcOH], 96% CH 3 CN 
1 nil/min 


3.1; 3.7 


1.9 


2-napthylglycolic 
acid 


Chirabiotic R 
100x4.6 mm 


4%[0.5% AcOH], 96% CH 3 CN 
1 ml/min 


3.7; 4.7 


1.10 


3 -pyridylglycolic 
acid 


Chirabiotic R 
100x4.6 mm 


5% [0.5% AcOH], 65% H 2 0, 
30% CH 3 CN, 2 ml/min 


4.4; 5.5 


1.11 


3 -thienylglycolic 
acid 


Chirabiotic R 
100x4.6 mm 


20%[0.5% AcOH], 80% CH 3 CN 
2 ml/min 


1.4; 2.5 


2.1 


phenyl lactic acid 


Chirabiotic R 
150x4.6 mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


2.8 OS): 4.0 (R) 


2.2 


2-methylphenyl 
lactic acid 


Chirabiotic R 
150x4.6 mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


2.5; 2.8 


2.3 


2-bromophenyl 
lactic acid 


Chirabiotic R 
150 x 4.6 mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


2.8; 3.2 


2.4 


2-fluorophenyl 
lactic acid 


Chirabiotic R 
150x4.6 mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


2.6; 2.9 


2.5 


3 -methylphenyl 
lactic acid 


Chirabiotic R 
150x4.6 mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


2.4; 3.2 


2.6 


3 -fluorophenyl 
lactic acid 


Chirabiotic R 
150 x 4.6mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


2.8; 3.6 


2.7 


1-napthyllactic 
acid 


Chirabiotic R 
150 x 4.6mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


2.7; 3.1 


2.8 


2-pyridyllactic 
acid 


Chirabiotic R 
150 x 4.6mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


2.5; 2.9 


2.9 


3-pyridyllactic 
acid 


Chirabiotic R 
150 x 4.6mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


2.9; 3.6 


2.10 


2-thienyllactic 
acid 


Chirabiotic R 
150 x 4.6mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


3.6; 4.6 


2.11 


3-thienyllactic 
acid 


Chirabiotic R 
150 x 4.6mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


3.5; 4.6 




Methyl(3-0 
[benzoyl] -4- 
cyano)-butanoate 


Daicel OD 
50 x 4.6 mm 


5% isopropanol, 95% hexane 
1 ml/min 


4.5 (R); 5.4(6) 



Cvanohydrin (Substrate) Synthesis: 
5 [000342] Mandelonitrile Synthesis Method A: Acetone cyanohydrin (685 ixL, 7.5 mmol), 
aldehyde (5 mmol), and catalytic DIEA (13 uL, 0.075 mmol) were mixed at 0 °C. The 
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reactions were stirred on ice for 45 minutes. To drive the equilibrium toward the product, 
acetone was removed in vacuo. Subsequently, crude reactions were acidified with H2SO4 (3 
pJL) and stored at -20°C. TLC was used to monitor reaction progress (3:1 hexane/ethylacetate 
(EtOAc). 

5 [000343] Mandelonitrile Synthesis Method B: To a solution of KCN (358 mg, 5.5 mmol) in 
MeOH (1 mL) at 0°C was added aldehyde (5 mmol) and acetic acid (315 p,L, 5.5 mmol). 
After stirring for one hour on ice, MeOH was removed in vacuo, and the crude mixture was 
partitioned using EtOAc and H 2 0. The organic fraction was retained and concentrated in 
vacuo. TLC analysis was used to monitor reaction progress (3:1 Hexanes/EtOAc). 

10 [000344] Aryl Acetaldehyde Cyanohydrin:, Arylacetic acid (50 mmol) was dissolved in 50 
ml anhydrous tetrahydrofuran (THF) in a two-neck 500 ml round-bottom flask under N 2 (g) 
atmosphere. To this solution cooled to 0 °C, under vigorous mixing, was added slowly 105 
mmol of thexylchloroborane-dimethyl sulfide (2.55 M in methylene chloride). The reaction 
was allowed to proceed overnight. Excess acetic acid (10 ml) was added to quench and 

1 5 acidify the reaction followed by the addition 1 0 ml water. After stirring at room temperature 
for 1 hour, solvent was removed in vacuo and the residue was dissolved in 100 ml water and 
extracted with 200 ml EtOAc. The EtOAc layer was dried over sodium sulfate, filtered and 
then concentrated in vacuo. Subsequently, 60 mmol of KCN, followed by 100 ml methanol - 
was added to the residue. The solution was then cooled to 0 °C and acetic acid (60 mmol) 

20 added. The reaction was stirred for 1-2 hours after all KCN dissolved. Solvents were removed 
in vacuo and residue was dissolved in 100 ml water and 200 ml EtOAc. The aqueous layer 
was extracted with EtOAc one more time. Combined EtOAc extracts were washed with 
saturated brine and dried over sodium sulfate, filtered and then concentrated in vacuo to 
obtain crude cyanohydrin product. The cyanohydrin was purified by silica-gel column 

25 (hexane/EtOAc), as necessary. 

[000345] 2-chloro mandelonitrile: *H NMR (CDC1 3 , 500 MHz) 8 7.69 (m, 1H), 7.41 (m, 
1H), 7.36 (m, 2H), 5.84 (s, 1H), 3.07 (br, 1H). 13 C NMR (CDCI3, 125 MHz) 8 132.89, 
132.73, 131.22, 130.19, 128.48, 127.84, 118.24, 60.87. MS calc'd for [C 8 H 6 ClNO] 167.01 
found 167.9 (LC-MS+). 

30 [000346] 2-bromomandelonitrile: *H NMR (CDC1 3 , 500 MHz) 8 7.72 (d, 1H, J= 6.58), 7.62 
(d, 1H, J= 8.35), 7.43 (t, 1H, J= 8.42), 7.30 (t, 1H, J= 7.00), 5.85 (s, 1H). 13 C NMR (CDC1 3 , 
125 MHz) 8 134.550, 133.584, 131.564, 128.819, 128.535, 122.565, 118.153, 63.379. 
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[000347] 2-methylmandelonitrile: *H NMR (CDCI3, 500 MHz) 5: 7.60 (d, 1H, J= 7.4), 
7.23-7.35 (m, 3H), 5.66 (s, 1H), 2.44 (s, 3H). 13 C NMR (CDC1 3 , 298 K, 125 MHz) 8: 
136.425, 133.415, 131.450, 130.147,127.204, 126.894, 118.952, 18.916. MS calc'd for 
[C9H9NO] 147.07, found 147.2 (ESI +). 
5 [000348] 3-chloromandelonitrile: *H NMR (CDC1 3 , 500 MHz) 5 7.55 (s, 1H), 7.43-7.37 (m, 
3H), 5.54 (s, 1H). 13 C NMR (CDC1 3 , 125 MHz) 8 137.183, 135.480, 130.718, 
130.303,127.047, 124.891, 118.395, 63.156. MS calc'd for [C 8 H 6 ClNO] 167.01 found 167.9 
(LC-MS +). 

[000349] 3-bromomandelonitrile: ! H NMR (CDC1 3; 500 MHz) 8 7.69 (s, 1H), 7.56 (d, J= 
10 6.2 Hz, 1H), 7.45 (d, J= 5.5Hz, 1H), 7.32 (t, J= 6.4. Hz, 1H), 5.53 (s, 1H). 13 C NMR (CDC1 3 , 
125 MHz) 8 137.376, 133.201, 130.934, 129.208,125.359, 123.380, 118.458, 63.006. MS 
calc'd for [C 8 H 6 BrNO] 212.0 found 211.9 (LC-MS +). 

[000350] 4-fluoromandelonitrile: : H NMR (CDC1 3 , 500 MHz) 8 5.54 (s, 1H), 7.13 (m, 2H), 
7.51-7.53 (m, 2H). 13 C NMR (CDC1 3 , 125 MHz) 8 63.02, 116.44, 118.97, 128.90, 131.54, 
15 132.51, 162.575. 

[000351] 4-chloromandelonitrile: *H NMR (C.DCl 3) 500 MHz) 8 7.47 (d, J= 7.0 Hz , 2H), 
7.42 (d, J= 7.0 Hz , 2H), 5.53 (s , 1H). 13 C NMR (CDC1 3 , 125 MHz) 8 136.209, 133.845, 
129.647, 128.232, 118.630, 63.154. MS calc'd for [C 8 H 6 ClNO] 167.01 found 167.9 (LC-MS 
+) 

20 [000352] 1-naphthyl cyanohydrin : X H NMR (CDC1 3 , 500 MHz) 8 8.14 (d, 1H, J= 8.5), 

7.92 (t, 2H, J= 6.1), 7.82 (d, 1H, J= 5.7), 7.62 (t, 1H, J= 6.1), 7.56 (t, 1H, J= 6.1), 7.50 (t, 
1H, J= 6.1), 6.18 (s, 1H); 13 C NMR (CDCI3, 125 MHz) 8 137.0, 135.7, 134.2, 131.1, 129.2, 
127.5, 126.7, 125.8, 125.3, 123.1, 119.0, 62.4; MS calc'd for [C12H9O] 183.21, found 183.2 
(ESI+). 

25 [000353] 2-naphthyl cyanohydrin: l H NMR (CDCI3, 500 MHz) 8 8.03 (s, 1H), 7.92 (d, 1H, 
J= 8.6), 7.87-7.91 (m, 2H), 7.61 (dd, 1H, J= 6.7, 1.2), 7.55-7.60 (m, 2H), 5.72 (s, 1H); 13 C 
NMR(CDC1 3 , 125 MHz) 8 134.9, 133.9, 132.7, 129.6, 128.6, 128.0, 127.4, 127.2, 126.4, 
123.9, 118.9, 64.1; MS calc'd for [Ci 2 H 9 0] 183.21, found 183.2 (ESI +). 
[000354] 3-pyridyl cyanohydrin: *H NMR (CDC1 3 , 500 MHz) 8: 8.62 (d, 1H, /= 1.8), 8.57 

30 (d, 1H, J = 5.1), 7.94 (d, 1H, J= 8.1), 7.41 (dd, 1H, J= 8.1, 5.1), 5.64 (s, 1H); 13 C NMR 
(CDCI3, 125 MHz) 8 149.921, 147.355, 135.412, 133.044, 124.443, 118.980, 61.085. MS 
calc'd for [C 7 H 6 N 2 0] 134.05, found 135.2 (ESI +). 
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[000355] 3-thienyl cyanohydrin: ! H NMR (CDC1 3 , 500 MHz) 8 7.45 (d, J= 2.2 Hz 1H), 
7.56 (dd, J= 6.2 Hz, 1H), 7.45 (d, J= 5.5Hz, 1H), 7.32 (t, J= 6.4. Hz, 1H), 5.53 (s, 1H). 13 C 
NMR(CDC1 3 , 125 MHz) 5 137.376, 133.201, 130.934, 129.208,125.359, 123.380, 118.458, 
63.006. MS calc'd for [C 6 H 5 NOS] 139.01 found 139.9 (LC-MS +). 

[000356] phenyl acetaldehyde cyanohydrin: *H NMR (CDCI3, 500 MHz) 8 7.34 (m, 5H), 
4.64 (t, J= 6.75 Hz, 1H), 3.11 (d, J= 6.75 Hz, 2H), 2.75 (br, 1H) : 13 C NMR (CDC1 3 , 125 
MHz) 8 133.96, 129.91, 129.16, 128.08, 119.47, 62.33, 41.55. 

[000357] 2-methylphenyl acetaldehyde cyanohydrin: *H NMR (CDC1 3 , 500 MHz) 8 7.11 
(m, 4H), 4.61 (t, J = 6.62 Hz, 1H), 3.12 (d, J= 6.62 Hz, 2H), 2.14(s, 3H) : 13 C NMR (CDC1 3 , 
125 MHz) 8 136.94, 136.47, 132.57, 130.48, 127.61, 125.75, 120.11, 62.95, 44.73 MS calc'd 
for [CioHnNO]: 161.08, found 162.2 (M+Na, ESI +) 

[000358] 2-bromophenyl acetaldehyde cyanohydrin: *H NMR (CDCI3, 500 MHz) 8 7.20 
(m, 4H), 4.78 (t, J= 6.5 Hz, 1H), 3.26 (d, J= 6.5 Hz, 2H). 13 C NMR (CDC1 3 , 100 MHz) 8 
133.93, 132.82, 131.72, 129.21, 128.12, 124.86, 119.41, 63.02, 44.89. 

[000359] 2-fluorophenyl acetaldehyde cyanohydrin: *H NMR (CDC1 3 , 500 MHz) 8 7.2 (m, 
2H), 7.02 (m, 2H), 4.50 (dd, J= 4.62 Hz, J= 7.88 Hz, 1H), 3.23(dd, J= 4.62 Hz, 1 J= 14.12 
Hz, 1H), 2.97 (dd, 7.88 Hz, 14.12 Hz, 1H). 13 C NMR (CDCI3, 125 MHz) 8 132.18, 131.52, 
129.66, 129.03, 128.07, 124.05, 115.8, 63.02, 44.79 MS calc'd for [C 9 H 8 FNO] 165.06, found 
164.2 (ESI +). 

[000360] 3-methylphenyl acetaldehyde cyanohydrin: *H NMR (CDC1 3 , 500 MHz) 8 7.18 
(m, 1H), 7.02 (m, 3H), 4.54 (dd, J= 4.62 Hz, J= 8 Hz, 1H), 3.06 (dd, J= 4.62 Hz, J= 14.38 
Hz, 1H), 2.83(dd, J= 8 Hz, J= 14.38 Hz, 1H), 2.36 (s, 3H) 13 C NMR (CDC1 3 , 125 MHz) 
8 176.25, 138.18, 136.0, 130.97, 128.93, 127.68, 126.58, 76.42, 34.29, 37.69 MS calc'd for 
[Ci 0 H 12 O 3 ] 180.08, found 180.0 (ESI+). 

[000361] 3-fluorophenyl acetaldehyde cyanohydrin: ! H NMR (CDCI3, 500 MHz) 8 7. 1 8 
(m, 2H), 6.95 (m, 2H), 4.44 (dd, 1H), 3.11(dd, 1H). 13 C NMR (CDC1 3 , 125 MHz) 8 130.40, 
125.53, 124.85, 116.92, 114.87, 114.50, 119.77, 61.97, 41.27. 

[000362] 1-napthyl acetaldehyde cyanohydrin: X H NMR (CDC1 3 , 500 MHz) 8 8.07(m, 1H), 
7.86(m, 1H), 7.74(m, 1H), 7.41 (m, 4H),4.20 (t, J= 1 Hz, 1H), 3.33 (d, J= 6.8 Hz, 2H) 13 C 
NMR(CDC1 3 , 125 MHz) 8 177.7, 140.31, 129.74, 129.24, 128.92, 128.26, 127.84, 125.63, 
124.53, 124.05, 123.42, 70.58, 38.0 MS calc'd for [C 13 HnNO] 197.08, found 197.1 (ESI +). 
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[000363] 2-pyridyl acetaldehyde cyanohydrin: ] H NMR (CDCI3, 500 MHz) 8 8.50 (m, 1H), 
7.85 (m, 1H), 7.48 (m, 1H), 7.34 (m, 1H), 4.42 (m, 1H), 3.19 (dd, J= 3.5 Hz, J= 13.7 Hz, 
2H). 13 CNMR(CDC1 3 , 125 MHz) 6 157.44, 145.69, 140.24, 126.96, 126.16, 122.99, 60.30, 
42.60 MS calc'd for [C 8 H 8 N 2 0] 148.06, found 149.1 (ESI +). 
5 [000364] 3-pyridyl acetaldehyde cyanohydrin: J H NMR (CDC1 3 , 500 MHz) 8 8.62 (d, 1H, J 
= 1.8), 8.57 (d, 1H, J= 5.1), 7.94 (d, 1H, J= 8.1), 7.41 (dd, 1H, J= 8.1, 5.1), 5.64 (s, 1H). 
13 C NMR (CDCI3, 125 MHz) 8: 149.921, 147.355, 135.412, 133.044, 124.443, 118.980, 
61.085. Exact Mass calculated for [C 7 H 6 N 2 0]: 134.05, found: 135.2 (ESI +). 
[000365] 2-thienyl acetaldehyde cyanohydrin: ! H NMR (CDC1 3 , 500 MHz) 8 7.1 (m, 1H), 
10 6.9 (m, 1H), 6.8 (m, 1H), 4.1 1 (t, J= 7.0Hz, 1H), 2.86 (d, /= 7.0Hz, 2H). 13 C NMR (CDC1 3 , 
125 MHz) 8 127.68, 127.41, 125.58, 124.60, 118.70, 63.25, 44.84. 

[000366] 3-thienyl acetaldehyde cyanohydrin: ! H NMR (CDC1 3 , 500 MHz) 8 7.09 (m, 

3H), 4.60 (t, J= 6.25Hz, 1H), 3.12 (d, J= 6.25Hz, 2H). 13 C NMR (CDC1 3 , 125 MHz) 

8 129.05, 127.16, 125.27, 122.65, 119.87, 61.58,44.90. 
15 [000367] Preparation of racemic mandelic acids standards from corresponding 

cyanohydrins: (Stoughton, R.W. J. Am. Chem. Soc. 1941, 63, 2376) 2-bromomandelonitrile . 

(230 mg, 1.08 mmol) was dissolved in cone. HC1 (1 mL) and stirred at room temperature for. 

1 8 h and then at 70 °C for 24 h. After cooling, the reaction mixture was extracted with diethyl 

ether (4x2 mL). Organic extracts were, combined, dried over MgS0 4 , filtered and 
20 concentrated in vacuo. 2-bromomandelic acid was isolated as a colorless powder (180 mg, 

0.78 mmol, 70 % yield). 

[000368] Preparation of racemic aryllactic acids standards from corresponding amino acids: 
Phenylalanine (10 mmol, 1 .65g) was dissolved in 30 ml 2N H2SO4 at room temperature under 
N 2 (g) atmosphere. Sodium nitrite (1 .4 g in 3 ml aqueous solution, 2 eq) solution was added 

25 slowly to the reaction mixture over a period of 3-4 hours with vigorous stirring at room 
temperature under N 2 (g) atmosphere. The reaction mixture was stirred overnight and the 
phenyllactic acid product was then extracted into diethylether (3 x 30 ml). Combined ether 
extracts were dried over MgSC>4 and then filtered and concentrated in vacuo. (Kenji, I.; 
Susumu, A.; Masaru, M.; Yasuyoshi, U.; Koki, Y.; Koichi, K. Patent Number, WO01 55074, 

i0 Publication date: 2001-08-02.) 

General Method for Enzymatic Preparation of a-hydroxy acids: 

[000369] (i?)-(-)-Mandelic Acid To a solution of mandelonitrile (1 .005 g, 7.56 mmol) in 
150 mL of sodium phosphate (100 mM) buffer at pH 8 with 10% v/v methanol, that had been 
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N 2 (g) sparged, at 37 °C, was added 9 mg of nitrilase 1 (normalized for nitrilase content). The 
reaction was conducted under N 2 (g) atmosphere on a rotating platform shaker. Reaction 
progress was monitored by withdrawing aliquots for HPLC analysis. After 3 h incubation, 
the reaction mixture was acidified to pH 2 with 1 N HC1 and extracted with diethyl ether (4 x 
50 ml). Organic fractions were concentrated in vacuo and then the residue was taken up in 
10% sodium bicarbonate solution. This aqueous solutions was then washed with diethyl ether 
(3 x 50 ml) and then acidified to pH 2 with 1 N HC1 and extracted with diethyl ether (3 x 50 
ml). Organic fractions were combined, washed with brine, dried over MgS0 4 , filtered and 
then concentrated in vacuo. (i?)-(-)-Mandelic acid (933 mg, 6.22 mmol) was isolated as a 
colorless powder in 86 % yield. *H NMR (DMSO-d 6 , 500 MHz) 5 12.6 (br, s, 1H) 7.41 (m, 
2H), 7.34 (m, 2H), 7.28 (m, 1H), 5.015 (s, 1H). 13 C NMR DMSO-d 6 , 125 MHz) 5 
174.083, 140.216, 128.113, 127.628, 126.628, 72.359. MS calc'd for [C 8 H 8 0 3 ] 150.07, found 
150.9 (ESI +); ee = 98 % [HPLC]. [a] 20 598 = -134.6 (c = 0.5, methanol). 
[000370] (-)-2-chloromandelic acid *H NMR (DMSO-d 6 , 500 MHz) 8 7.75 ( m, 1H), 7.44 
(m, 1H), 7.34 (m, 2H), 5.34 (s, 1H). 13 C NMR (DMSO, 298K, 125MHz) 8 173.070, 
137.985, 132.105, 129.399, 129.158, 128.705, 127.235. MS calc'd for [C 8 H 7 C10 3 ] 186.0, 

found 185.0 (LC-MS -). ee = 96 % [HPLC]. 92 % yield. [a] 20 598 = -137.6 (c= 0.5, ethanol). 
[000371] (-)-2-bromomandelic acid *H NMR (DMSO-d 6 , 500 MHz) 5 7.60 (d, J= 7.93, 
1H), 7.48 (m, 1H), 7.40 (m, 1H), 7.25 (m, 1H), 5.30 (s, 1H). 13 C NMR DMSO-d 6 , 125 

MHz) 6 172.994, 139.61, 132.355, 129.652, 128.753, 127.752, 122.681, 71.644. MS calc'd 

for [C 8 H 7 Br0 3 ] 230.0, found 230.9. ee = 96% [HPLC]. 92% yield. [a] 20 598 = -1 16.4 (c= 0.5, 

ethanol). 

[000372] (-)-2-methylmandelic acid *H NMR (DMSO-d 6 , 500 MHz) 8 1 1.78 (bs, 1H) 7.38 
(m, 1H), 7.16-7.38 (m, 3H), 5.18 (s, 1H), 2.35 (s, 3H). 13 C NMR DMSO-d6, 125 MHz) 8 
174.229, 138.623, 135.649, 130.129, 127.491, 126.990, 125.698, 125.698, 69.733, 18.899. 
MS calc'd for [C 9 Hi 0 O 3 ] 166.1, found 165.2. ee = 91 % [HPLC]. 86 % yield. [a] 20 598 = - 
164.4 (c = 0.5, ethanol). 

[000373] (-)-3-chloromandelic acid *H NMR (DMSO-dg, 500 MHz) 8 7.46 (s, 1H), 7.36 (m, 
3H), 5.07 (s, 1H). 13 C NMR (DMSO, 298K, 125MHz) 8 173.554, 142.685, 132.813, 130.069, 
127.568, 126.355, 125.289,71.659. MS calc'd for [C 8 H 7 C10 3 ] 186.0, found 185.34 (MALDI 
TOF -). ee = 98 % [HPLC]. 70 % yield. [a] 20 598 = -120.4 8 (c= 0.5, methanol). 
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[000374] (-)-3-bromomandelic acid J H NMR (DMSO-d 6 , 500 MHz) 8 7.60 (s, 1H ), 7.49 
(m, 1H), 7.42 (m, 1H), 7.31 (m, 1H), 5.06 (s, 1H). 13 C NMR (DMSO, 298K, 125MHz) 8 
173.551, 142.917, 130.468, 130.379, 129.237, 125.687, 121.404, 71.605. MS calc'd for 
[C 8 H 7 Br0 3 ] 229.98, found 229.1 (LC-MS). ee = 98 % [HPLC]. 82 % yield. [a] 20 598 = -84.8 
(c= 0.5, ethanol). 

[000375] (-)-4-fluoromandelic acid J H NMR (DMSO, 298K, 500MHz) 8 12.65 (s, 1H), 
7.44 (m, 2H), 7.17 (m, 2H), 5.91 (s, 1H), 5.03 (s, 1H) 13 C NMR (DMSO, 298K, 125MHz) 5 
173.93, 162.57, 136.47, 128.61, 128.55, 114.96, 114.80, 71.61. MS calc'd for [C 8 H 7 F0 3 ] 
170.0, found 168.8. ee = 99% [HPLC]. 81% yield. [a] 20 598 = -152.8 (c= 0.5, methanol). 
[000376] (-)-l-naphthylglycolic acid ! H NMR (DMSO-d 6 , 500 MHz) 8 8.28-8.26 (m, 
1H), 7.87-7.93 (m, 2H), 7.47-7.58 (m, 4H), 5.66 (s, 1H). 13 C NMR DMSO-d 6 , 125 MHz) 
8 174.288, 136.284, 133.423, 130.654, 128.353, 128.192, 125.926, 125.694, 125.613, 
125.266, 124.558, 70.940. MS calc'd for [C 12 H 10 O 3 ]: 202.21 found 201.37 (MALDI TOF -). 
ee = 95% [HPLC]. 90 % yield [a] 20 598 = -1 15.4 (c = 0.5, ethanol). 
[000377] (-)-2-naphthylglycolic acid ! H NMR (DMSO-d 6 , 500 MHz) 8 12.6 (bm, 1H), 
7.88-7.93 (m, 4H), 7.48-7.56 (m, 3H), 5.20 (s, 1H). 13 C NMR DMSO-d 6 , 125 MHz) 8 
174.005,137.760,132.644,132.498, 127.811, 127.658, 127.506, 127.209, 125.993, 125.334, 
124.761, 72.472. MS calc'd for [Ci 2 Hi 0 O 3 ] 202.21, found 201.37 (MALDI TOF). ee = 98% 
[HPLC] . 68% yield. [a] 20 59 8 = -1 1 5 .4 (c = 0.5, ethanol). 

[000378] (-)-3-pyridylglycolic acid This Reaction was performed in 100 mM ammonium 
formate buffer at pH 8. To isolate the product, the reaction mixture was filtered through a 
10,000 MWCO membrane to remove enzyme and then concentrated in vacuo. l R NMR 
(DMSO-ds, 500 MHz) 8 8.56 (s, 1H), 8.36 (d, J= 4.57 Hz, 1H), 8.25 (s, 1H), 7.71 (m, 1H), 
7.25 (dd, J= 4.98, 4.80 Hz 1H), 5.45 (s, 1H). 13 C NMR DMSO-d 6 , 125 MHz) 8 165.91 1, 
147.862, 147.251, 139.1 18, 133.381, 122.746, 71.508. MS calc'd for [C 7 H 7 N0 3 ] 153.04, 
found 154.0 ((MALDI TOF). ee = 92% [HPLC], 84% yield, [a] 20 598 = -65.2 (c = 0.5, H 2 0). 
[000379] (-)-3-thienylglycolic acid J H NMR (DMSO-d 6 , 500 MHz) 8 7.48 (m, 1H), 7.45 
(d, J= 2.81, 1H,), 7.10 (m, 1H), 5.09 (s, 1H), 3.33 (s, 1H) 13 C NMR (DMSO, 298K, 
125MHz) 8 173.704, 141.109, 126.446, 126.042, 122.247, 68.915 MS calc'd for [C 6 H 6 0 3 S] 
158.00, found 157.224 (MALDI TOF). ee = 95 % [HPLC]. 70 % yield. [af° 59S = -123.2 8 
(c= 0.5, methanol). 
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[000380] (5)-(-)-phenyllactic acid *H NMR (DMSO-d 6 , 500 MHz) 8 7.28(m, 5H), 
4.17(dd, /= 4.5 Hz, J= 8.3 Hz, 1H), 2.98(dd, J= 4.5 Hz, J= 13.7 Hz, 1H), 2.79 (dd, J= 8.3 
Hz, J= 13.7 Hz, 1H). 13 C NMR (DMSO, 298K, 125MHz) 8 178.16, 133.4, 129.27, 128.6, 
127.3, 70.45, 44.12. ee = 97 % [HPLC], 84 % yield. [cc] 20 598 = -17.8 (c= 0.5, methanol). 
[000381] (-)-2-methylphenyllactic acid *H NMR (DMSO-d 6 , 500 MHz) 8 7.16 (m, 4H), 
4.47 (dd, J= 3.9 Hz, J= 8.8 Hz, 1H), 3.25(dd, J= 3.9Hz, 14.3 Hz, 1H), 2.94 (dd, J= 8.8 Hz, 
J= 14.3Hz), 2.35(s, 3H). 13 C NMR (DMSO, 298K, 125MHz) 8 178.61, 137.08, 134.74, 
130.80, 130.25, 127.44, 126.34, 70.93, 37.67, 19.79. MS calc'd [CioH 12 0 3 ] 180.08, found 
180.0 (ESI +). 86 % yield, ee = 95 % [HPLC]. [a] 20 598 = -13.2 (c= 0.5, methanol). 
[000382] (-)-2-bromophenyllactic acid *H NMR (DMSO-d 6 , 500 MHz) 8 7.28 (m, 4H), 
4.60(dd, J= 4.0 Hz, J= 9.1 Hz, 1H), 3.45(dd, J= 4.0 Hz, J= 14.1 Hz, 1H), 3.04(dd, J= 8.0 
Hz, J= 14.1 Hz, 1H). 13 C NMR (DMSO, 298K, 125MHz) 8 178.70, 136.05, 133.21, 132.10, 
128.99, 127.72, 125.0, 70.04, 40.76. MS calc'd for [C 9 H 9 Br0 3 ] 243.9, found 243.3 (ESI +). 
91 % yield, ee = 93 % [HPLC], [a] 20 598 = -17.6 (c= 0.5, methanol) 

[000383] (-)-2-fluorophenyllactic acid *H NMR (DMSO-d 6 , 500 MHz) 8 7.10 (m, 4H), 4.64 
(t, J= 6.8 Hz, 1H), 3.11(d, J= 6.8 Hz, 2H). 13 C NMR (DMSO, 298K, 125MHz) 8 132.18, 
131.52, 129.66, 129.03, 128.07, 124.05, 115.8, 63.02, 44.79. MS calc'd for [C 9 H 8 FNO]: 
165.06, found 164.2 (ESI+). 91% yield, ee = 88 % [HPLC]. [a] 20 598 = -14.0 (c= 0.5, 
methanol). 

[000384] (-)-3 -methylphenyllactic acid X H NMR (DMSO-d 6 , 500 MHz) 8 7. 1 8 (m, 1H), 
7.02 (m, 3H), 4.54 (dd, J= 4.6 Hz, J= 8.0 Hz, 1H), 3.06(dd, J= 4.54 Hz, J= UA Hz, 1H), 
2.83(dd, J= 8.0 Hz, J= 14.4 Hz, 1H), 2.36 (s, 3H). 13 C NMR (DMSO, 298K, 125MHz) 8 
175.88, 163.80, 130.33, 130.09, 125.7, 116.68, 113.75, 71.31, 34.28. MS calc'd for 
[CioHnNO] 161.08, found 162.2 (ESI +). 80 % yield, ee = 98 % [HPLC]. [cc] 20 598 = -2.4 (c= 
0.5, methanol). 

[000385] (-)-3-fluorophenyllactic acid *H NMR (DMSO-d 6 , 500 MHz) 8 7.2 (m, 1H), 6.9 
(m, 3H), 4.56 (dd, 4.5 Hz, J= 7.9 Hz, 1H), 3.09(dd, J= 4.5 Hz, J= 14.1 Hz, 1H), 2.86 (dd, J 
= 7.9 Hz, J= 14.1 Hz, 1H). 13 C NMR (DMSO, 298K, 125MHz) 8 175.88, 163.80, 130.33, 
130.09, 125.7, 116.68, 113.75, 71.31, 34.28. MS calc'd for [C 9 H 9 0 3 F] 184.05, found 184.1 
(ESI +). 82 % yield, ee = 97 % [HPLC]. [a] 20 59 8 = -5.2 (c= 0.5, methanol). 
[000386] (-)-l-napthyllactic acid X H NMR (DMSO-d 6 , 500 MHz) 8 8.57 (m, 1H), 8.21(m, 
1H), 8.08 (m, 1H), 7.61 (m, 4H), 4.64 (dd, 3.5 Hz, 8.5 Hz, 1H), 3.84 (dd, J= 3.5 Hz, J= 14.5 
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Hz, 1H), 3.38 (dd, J= 8.5 Hz, J= 14.5 Hz, 1H) 13 C NMR (DMSO, 298K, 125MHz) 5 177.7, 
140.31, 129.74, 129.24, 128.92, 128.26, 127.84, 125.63, 124.53, 124.05, 123.42, 70.58, 38.0. 
MS calc'd for [C 13 H n NO] 197.08, found 197.1(ESI +). 87 % yield, ee = 94 % [HPLC]. 
[a] 20 598 = -16.2 (c= 0.5, methanol). 
5 [000387] (-)-2-pyridyllactic acid *H NMR (DMSO-d 6 , 500 MHz) 8 8.49 (m, 1H), 7.62 (m, 
1H), 7.21 (m, 2H), 4.50 (t, J= 5.0 Hz, 1H), 3.01 (d, J= 5.0 Hz, 2H). 13 C NMR (DMSO, 
298K, 125 MHz) 5 178.8, 159.79, 148.84, 136.89, 124.35, 121.75, 71.14, 44.09. MS calc'd 
for [C 8 H 9 N03]: 167.06, found 167.0. (ESI +). 62 % yield, ee = 94 % [HPLC], [a] 20 598 = - 
3.6 (c= 0.5, methanol). 

10 [000388] (-)-3-pyridyllactic acid *H NMR (DMSO-d 6 , 500 MHz) 8 8.43(m, 2H), 7.62(m, 
1H), 7.28(m, 1H), 4.57(t, 5.37Hz, 1H), 2.85(d, 5.37Hz, 2H). 13 C NMR (DMSO, 298K, 125 
MHz) 8 176.6, 150.03, 147.12, 136.41, 129.45, 123.26, 61.56, 31.46 MS calc'd for 
[CsH 9 N0 3 ] 167.06, found 167.0 (ESI +). 59 % yield, ee = 94 % [HPLC]. [a] 20 598 = -4.0 (c 
= 0.5, methanol). 

1 5 [000389] (-)-2-thienyllactic acid l H NMR (DMSO-d 6 , 500 MHz) 8 7. 1 8(m, 1H), 6.94(m, 
1H), 6.90 (m, 1H), 4.49 (dd, J= 4.1 Hz, J= 6.25 Hz, 1H), 3.36 (dd, J= 4.1 Hz, J= 15.0 Hz, 
1H), 3.26(dd, J= 6.25 Hz, J= 15.0 Hz, 1H). 13 C NMR (DMSO, 298K, 125MHz) 6 127.68, 
127.41, 125.58, 124.60, 118.70, 63.25, 44.84. MS calc'd for [C 7 H 7 NOS] 153.02, found 153.0 
(ESI +). 85 % yield, ee = 95 % [HPLC]. [a] 20 598 = -13.0 (c= 0.5, methanol). 
[000390] (-)-3-thienyllactic acid *H NMR (DMSO-d 6 , 500 MHz) 8 7.30(m, 1H), 7. 1 3(m, 
1H), 7.01(m, 1H), 4.50 (dd, ./= 4.25 Hz, J= 6.5 Hz, 1H), 3.21(dd, J= 4.25 Hz, J= 15.0 Hz, 
1H), 3.10 (dd, J= 6.5 Hz, J= 15.0 Hz, 1H). 13 C NMR (DMSO, 298K, 125MHz) 8 127.50, 
136.09, 128.83, 126.24, 123.32, 70.65, 34.84. MS calc'd for [C 7 H 8 0 3 S] 172.02, found 172.1 
(ESI +). 81 % yield, ee = 96 % [HPLC]. [a] 20 598 = -18.8 (c= 0.5, methanol). 
Enzymatic Hydro lysis of 3-Hydroxvglutarvlnitrile: 

[000391] 3-Hydroxyglutarylnitrile (1 .0 g, 9.0 mmol, 240 mM) was suspended in N 2 (g) 
sparged sodium phosphate buffer (37.5 mL, pH 7, 100 mM) at room temperature. Cell lysate 
(30 mg, normalized for nitrilase content) was added to bring the concentration to 0.8 mg/ml 
enzyme and the reaction was at shaken at 100 rpm, room temperature. Reaction progress was 
monitored by TLC (1 : 1 EtOAcrHexanes, Rf=0.32, nitrile; Rf=0.0, acid) After 22 h, the 
reaction was acidified with 1M HC1. The reaction mixture was continuously extracted with 
diethyl ether. The acid product was isolated as a yellow oil (1.15 g, 98 % yield). ! H NMR 
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(DMSO, 298K, 500MHz) 5 12.32 (s, 1H), 5.52 (s, 1H), 4.10 (m, 1H), 2.70 (dd, 1H, J = 16.8, 
4.1 Hz), 2.61 (dd, 1H, J = 16.9, 6.3 Hz), 2.44 (dd, 1H, J = 15.4, 5.3 Hz), 2.37 (dd, 1H, J = 
15.6, 7.8 Hz). 13 C NMR (DMSO, 298K, 125 MHz) 5 171.9, 118.7, 63.4, 41.2, 25.2 MS 
calc'd for [C5H7NQ3]: 129.0, found 130.0 [M+H + ], (ESI +). 

Preparation of fRVf-V Methvl f3-0-rbenzovn-4-cvanoVbutanoate 

[000392] Benzoyl chloride (0.068 ml, 0.752 mmol) was added to a stirred solution of (R)- 
methyl-(3-hydroxy-4-cyano)-butanoate (71.7 mg, 0.50 lmmol) in pyridine (2.0 ml), at room 
temperature. After 19 hours, add an additional 0.5 equivalent of benzoyl chloride (0.023ml, 
0.25 lmmol). Reaction was complete at 23 h, as determined by TLC. Add 1ml H 2 0, extract 
with ether (3 x 10ml). Wash with brine (2 x 10ml). Dry combined aqueous extracts with 
MgS0 4 . Filter off drying agent and remove solvent by rotary evaporation. Purify by column 
chromatography (hexanerethyl acetate [2:1]. Rotary evaporation of fractions yielded the 
product as a yellow oil (46 mg, 0.186 mmol, 37%). ! H NMR (DMSO, 298K, 500MHz) S 
7.96 (d, 2H, J = 7.8), 7.70 (t, 1H, J = 7.25), 7.56 (t, 2H, J = 7.8), 5.55 (m, 1H), 3.59 (s, 3H), 
3.13 (m, 2H), 2.90 (m, 2H). 13 C NMR (DMSO, 298K, 125MHz) 8 169.6, 164.5, 133.8, 129.3, 
128.9, 128.5, 117.3,66.0,51.8,37.5,22.2 MS calc'd for [C 13 H 13 N0 4 ]: 247.25, found 270.3 
[M+Na + ] ee = 95% [HPLC]. [a] 20 598 -32.4 (c = 0.5, CHC1 3 ). 

Synthesis of (RVEthv1-r 3-hvdroxv-4cvanoVbutanoate 

[000393] A 0.2 M solution of (i?)-3-hydroxy-4-cyano-butanoic acid (50 mg, 0.387 mmol) in 
anhydrous ethanol (1.94 mL) was prepared. The ethanol solution was added dropwise to 1.0 
ml of a 50:50 (v/v) mixture of anhydrous 1 M HC1 ethereal solution and anhydrous ethanol 
over sieves. The reaction was stirred overnight at room temperature under N 2 (g) atmosphere. 
The reaction was monitored by TLC, (1:1 EtOAc:Hexanes, R f = 0.45, ester; R f = 0.0, acid, 
stained with p-anisaldehyde). After 30 hrs, solvent was removed by rotary evaporation. The 
crude product was taken up in 25 mL ether, washed with 5 mL saturated bicarbonate and then 
5 mL brine. The organic extract was dried over MgS0 4 , filtered and then concentrated in 
vacuo, yielding the product as a clear oil. J H NMR (DMSO, 298K, 500MHz) 8 5.60 (d, 1H, 
J= 5.58 Hz), 4.12 (m, 1H), 4.07 (q, 2H, J= 7.1), 2.66 (m, 2H), 2.47 (m, 2H), 1.87 (t, 3H, J= 
7.0). 13 C NMR (DMSO, 298K, 125 MHz) 5 170.21, 1 18.60, 63.40, 59.98, 41.10, 25.14, 
14.02. MS calc'd for [C 7 H„N0 3 ]: 157.1, found 158.2. [M+H + ] 
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Example 1 3: Optimization Of Nitrilases For The Enantioselective Production Of 
ffiJ-2-Chloromandelic Acid 



[000394] Chloromandelic acid has the structure: 



HQ H 




COOH 



[000395] Nitrilases were identified which selectively produced fi?>2-chloromandelic 
acid from (R, ^-2-chloromandelonitrile. Nitrilases were identified which were useful to 
improve the enantioselectivity of the enzymes and establishing the effects of process 
conditions on the enzymes. An examination of the reaction conditions for the enzymatic 
nitrile hydrolysis was carried out in order to improve the enantiomeric excess of the product. 
Additionally, further investigation into the effects of process conditions on the enzyme was 
performed. 



OH 



HQ H 




COOH 



2-chloromandelonitrile (/?)-2-chloromandelic acid 



[000396] In this aspect, the enantioselective production of (22>2-chloromandelic acid 
was the target. One enzyme, SEQ ID NOS:385, 386, was selected for further confirmation of 
its enantioselectivity on 2-chloromandelonitrile. SEQ ID NOS:385, 386 was shown to be 
stable to process components, with a half-life of 8 hours. The enzyme was inhibited by 2- 
chlorobenzaldehyde and a contaminant in the cyanohydrin substrate, 2-chlorobenzoic acid. 
The enzymatic reaction was scaled up to a substrate concentration of 45 mM 2- 
chloromandelonitrile. Over 90% conversion was obtained, with ee of '97%. The chiral HPLC 
method was improved, to remove a contaminating peak that was present in the substrate. 
Improved accuracy in the determination of enantioselectivity was obtained using this method. 
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[000397] Nitrilases were screened against 2-chloromandelonitrile, with 3 1 nitrilases 
exhibiting activity on this substrate. High enantioselectivities were shown by 9 enzymes. 
The optimization of 5 of these enzymes was undertaken and one of them was identified as a 
candidate for the next stage of development. 
5 [000398] In an effort to improve the enantioselectivity of the selected enzymes for (R)- 
2-chloromandelic acid, a number of factors that are known to affect this property, together 
with the activity of the enzymes, were investigated. These included pH, temperature, buffer 
strength and addition of solvents to the reaction. Initially, 5 nitrilases were selected for these 
studies, based on the high enantioselectivities obtained by these enzymes. These enzymes 
10 were: SEQ ID NOS:385, 386, SEQ ID NOS:197, 198, SEQ ID NOS:217, 218, SEQ ID 
NOS:55, 56, and SEQ ID NOS:167, 168. 
Effect of dH 

[000399] The enzymatic reactions were run at a range of pH values, from pH 5 to pH 9. 
An increase in both activity and enantioselectivity with increasing pH was observed for all of 
15 the enzymes. With the exception of SEQ ID NOS:385, 386, pH 9 (0.1 M Tris-HCl buffer) 

was determined as the optimum for activity and enantioselectivity. The optimum pH for SEQ 
ID NOS:385, 386 was pH 8 (0.1 M sodium phosphate buffer). 
Effect of temperature 

[000400] The enzymes exhibited similar temperature profiles, with the highest activities 
20 being measured at 37°C and 45°C. Although the latter temperature resulted in higher 

conversions, the enantioselectivity of most of the enzymes showed a clear preference for the 
lower temperatures, with ee values being 10-20% lower when the temperature was raised 
above 37°C. In the case of SEQ ID NOS:385, 386 a slight optimum for enantioselectivity 
was evident at 37°C. Therefore, this temperature was established as the optimum for 
25 hydrolysis of 2-chloromandelonitrile by these enzymes. 
Effect of enzyme concentration 

[000401] During the concurrent investigation into the enantioselective hydrolysis of 
phenylacetaldehyde cyanohydrin to L-phenyllactic acid, the concentration of the enzyme in 
the reaction was found to have a significant effect on the enantioselectivity of the reaction. 
30 This provided an indication that the enzymatic hydrolysis rate was faster than the rate of 
racemization of the remaining cyanohydrin in the reaction. On this basis, the effect of 
enzyme concentration on the enantioselectivity of the enzymes towards (R)-2- 
chloromandelonitrile was investigated. Enzymatic reactions were performed with the 
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standard concentration of enzyme (0.6 mg protein/ml), half the standard concentration and 
one-tenth of the standard concentration. 

[000402] The following Table indicates the highest conversions achieved for the 
reactions, with the corresponding ee. With the exception of SEQ ID NOS:385, 386, it 
appears that very little, if any, increased enantioselectivity is observed. Therefore, it appears 
that the rate of racemization of the remaining chloromandelonitrile is not a limiting factor to 
obtaining higher enantioselectivities. 
Investigation of other positive enzymes 

[000403] hi addition to the enzymes in the above Table, a number of other nitrilases 
were screened for their enantioselectivities on 2-chloromandelonitrile. Some of these 
enzymes were newly discovered enzymes. Some were reinvestigated under conditions that 
have since been found to be optimal for these enzymes (pH 8 and 37°C). The results of this 
screening are shown below in the Table. 

Effect of co-solvent concentration 

[000404] The addition of methanol as a cosolvent in the enzymatic reactions was shown 
to enhance the ee. In order to establish the lowest level of methanol that could be added to 
the reactions, the enzyme reactions were performed at varying concentrations of methanol, 
ranging from 0-20% (v/v). No significant differences in enantioselectivity were evident 
between the various methanol concentrations. However, the ee in these reactions was 97- 
98%, while that of the control reaction, with no added methanol was 95-96%. While this 
difference in ee is small, the effect of the methanol was shown in more than one set of 
experiments during the course of this investigation and is therefore regarded as significant. 

Effect of reaction components on activity of SEP ID NOS:385. 386 
[000405] A vital part of an investigation into process optimization of an enzyme 
involves the determination of the effects of any compounds which could be present in the 
enzymatic reaction. For SEQ ID NOS:385, 386, these components were established as the 
starting material and equilibrium product of the cyanohydrin, 2-chlorobenzaldehyde; the 
product, 2-chloromandelic acid and the contaminant detected in the substrate, 2- 
chlorobenzoic acid. The addition of cyanide to the reaction was found to have no effect on 
the enzyme activity. The presence of trace amounts of triethylamine was also found to be 
tolerable to the enzyme. 
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[000406] The effect of the various reaction components on the activity of SEQ ID 
NOS:385, 386 was assessed by addition of various levels of possible inhibitors to the enzyme 
reaction. From these experiments, it appeared that both the aldehyde and its oxidation 
product, 2-chlorobenzoic acid were detrimental to enzyme activity. Approximately 70% and 
40% of the activity of SEQ ID NOS:385, 386 was lost upon addition of 5 mM 2- 
chlorobenzaldehyde or 5 mM 2-chlorobenzoic acid to the reaction, respectively. 
Scale-up hvdrolvsis of 2-chloromandelonitrile 

[000407] In order to confirm the conversion and enantioselectivity obtained by SEQ ID 
NOS:385, 386 for the production of (R)-2-chloromandelic acid, a larger scale reaction was 
performed and the product isolated from the aqueous mixture. The reaction was performed in 
a 20 ml reaction volume, with a substrate concentration of 45 mM 2-chloromandelonitrile. 
Complete conversion of the cyanohydrin was obtained, with 30 mM product formed. The ee 
of the product was 97% and the specific activity of the enzyme was 0.13 mmol product/mg 
nitrilase/h. 

[000408] It is evident from this experiment, together with the other experiments 
performed, that the formation of product does not account for the complete loss of substrate. 
In all experiments, a nitrile-containing control sample was run, in order to determine the 
extent of breakdown of the cyanohydrin. Overall, it appears that approximately 50% of the 
substrate is lost over a period of 4 hours at 37°C. It is expected that this breakdown would be 
to its equilibrium products, cyanide and 2-chlorobenzaldehyde, which could undergo further 
oxidation. A larger scale reaction was also run at a substrate concentration of 90 mM 2- 
chloromandelonitrile. However, no product was detected in this reaction. At higher substrate 
concentrations, it is expected that the concentration of the equilibrium product, 2- 
chlorobenzaldehyde and the contaminant, 2-chlorobenzoic acid will be present in higher 
amounts. Based on the results above, it is possible that the enzyme will be completely 
inhibited under such conditions. 
Reactions under biphasic conditions 

[000409] The use of biphasic systems can facilitate product recovery following the 
enzymatic reaction step. These systems can be also be used for the removal of products or 
by-products which are inhibitory to the enzyme. The nitrilases were shown to be active under 
biphasic conditions using a variety of solvents. Following the low conversions obtained at 
the higher substrate concentration above, further investigation of a biphasic system was 
performed with the hit enzyme, SEQ ID NOS:385, 386. It was important to ascertain 

109 



WO 03/097810 PCT/US03/15712 

whether any inhibitory factors could be removed by the solvent phase and whether any 
process advantages could be gained by the use of a biphasic system. 

[000410] Promising results were obtained with hexane as the organic phase. Therefore, 
further investigations involved the use of this solvent at two different levels: 100% and 70% 
of the volume of the aqueous phase, with increasing substrate concentrations, up to 90 mM. 
The substrate was dissolved in the organic phase. The level of hexane did not appear to 
affect the level of product formation, particularly at the higher concentrations of 2- 
chloromandelonitrile. 

[000411] Once again, high conversion was observed in a biphasic system, with a 76% 
yield of product being observed after 5 hours. The rate of product formation appeared to be 
slightly lower than in the corresponding monophasic system, where the reaction is complete 
within 1 hour. Lower enantioselectivity was observed in the biphasic system. Some 
possibilities which may account for these results are (i) the mass transfer rate is lower than 
the rate of enzyme activity or (ii) the non-polar solvent directly affects the enzyme. 
[000412] At a higher substrate concentration, a very low conversion was observed, with 
7 mM 2-chloromandelic acid being formed from 90 mM 2-chloromandelonitrile. This level 
of conversion, albeit low, was higher than that observed in the monophasic system with the 
same substrate concentration. These results suggest that some of the inhibitory 2- 
chlorobenzaldehyde or 2-chlorobenzoic acid is retained in the non-polar organic solvent. 
Standard assay conditions: 

[000413] The following solutions were prepared: 

- Substrate stock solution: 50 mM of the cyanohydrin substrate in 0.1 M phosphate buffer 
<pH 8). 

- Enzyme stock solution: 3.33 ml of 0.1 M phosphate buffer (pH 8) to each vial of 20 mg of 
lyophilized cell lysate (final concentration 6 mg protein/ml) 

[000414] The reaction volumes varied between the different experiments, depending on the 
number of time points taken. Unless otherwise noted, all reactions consisted of 25 mM 2- 
chloromandelonitrile and 10% (v/v) of the enzyme stock solution (final concentration 0.6 mg 
protein/ml). The reactions were run at 37°C, unless otherwise stated. Controls to monitor the 
nitrile degradation were run with every experiment. These consisted of 25 mM 2- 
chloromandelonitrile in 0.1 M phosphate buffer (pH 8). 

[000415] Sampling of reactions: The reactions were sampled by removing an aliquot from 
each reaction and diluting these samples by a factor of 8. Duplicate samples were taken for 
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analysis by chiral and achiral HPLC methods. The reactions were sampled at 0.5, 1, 1.5, 2, 3, 
and 4 hours, unless otherwise shown in the figures above. 
HPLC methods 

The achiral HPLC method was run on a SYNERGI-RP™ column (4 jLim; 50x2 mm) 
5 with a mobile phase of 10 mM Na phosphate buffer (pH 2.5). A gradient of methanol was 
introduced at 3.5 min and increased to 50% over 1.5 min, following which the methanol was 
decreased to 0%. Elution times for 2-chloromandelic acid and 2-chloromandelonitrile were 
2.5 and 6.1 minutes, with another peak appearing with the nitrile at 5.9 minutes. 
[000416] As described above, the chiral HPLC method was optimized during the course 

10 of the investigation, to improve the separation between 2-chlorobenzoic acid and (S)-2- 
chloromandelic acid. The optimized method was used during the latter half of the 
investigation and was run on a CHIROBIOTIC-R™ column. The mobile phase was 80% 
Acetonitrile:20% of 0.5% (v/v) acetic acid. Elution times for (S)-2-chloromandelic acid and 
(R)-2-chloromandelic acid were 2.4 and 3.5 minutes respectively. A peak for 2- 

15 chlorobenzoic acid eluted at 1 .9 minutes. For each experiment, a standard curve of the 
product was included in the HPLC run. The concentration of product in the samples was 
calculated from the slope of these curves. 
Effect of pH 

[000417] The effect of pH on the enzyme activity and enantioselectivity was studied by 
20 performance of the standard assay in a range of different buffers: 0.1 M Citrate Phosphate pH 
5; 0.1 M Citrate Phosphate pH 6; 0.1 M Sodium Phosphate pH 6; 0.1 M Sodium Phosphate 
pH 7; 0.1 M Sodium Phosphate pH 8; 0.1 M Tris-HCl pH 8; and 0.1 M Tris-HCl pH 9. The 
standard enzyme concentration was used for all enzymes, with the exception of SEQ ID 
NOS:385, 386, where half the standard concentration was used (5% v/v of the enzyme stock 
25 solution). 

Effect of temperature 

[000418] The effect of temperature on the activity and enantioselectivity was 
investigated by performing the standard assay at a range of different temperatures: room 
temperature, 37°C, 45°C, 50°C and 60°C. The standard enzyme concentration was used for 
30 all enzymes, with the exception of SEQ ID NOS:385, 386, where half the standard 
concentration was used (5% v/v of the enzyme stock solution). 
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Effect of enzyme concentration 

[000419] Reactions were run under standard conditions, with varying enzyme 
concentrations: 1%, 5% and 10% (v/v) of the enzyme stock solution. The reaction volume 
was normalized with the appropriate buffer. 
5 Addition of solvents 

[000420] The enzyme reactions were performed in the presence of methanol as a 
cosolvent Methanol was added to the standard reaction mixture at the following levels: 0, 5, 
10, 15 and 20% (v/v). 

[000421] Biphasic reactions with hexane were also investigated. The aqueous phase 

10 contained 10% (v/v) of the enzyme stock solution in 0. 1 M phosphate buffer (pH 8). The 
cyanohydrin was dissolved in the hexane, prior to addition to the reaction. Two levels of 
organic phase were used: 1 equivalent and 0.7 equivalents of the aqueous phase volume. In 
addition, a range of nitrile concentrations was investigated: 25, 45 and 90 mM. These 
reactions were run at room temperature. 

15 [000422] Samples from these reactions were taken both from the aqueous and the 

solvent phase. The hexane was evaporated by centrifugation under vacuum and redissolved 
in a 50:50 mixture of methanol and water, so that the samples were at the same dilution as the 
aqueous samples. Analysis of the samples was performed by non-chiral and chiral HPLC. 
Effect of process components 

20 [000423] (i) Activity: The effect of the process components on the activity of the 

enzymes was established by addition of the individual components, 2-chlorobenzaldehyde, 2- 
chlorobenzoic acid or 2-chloromandelic acid, to the enzymatic reaction. The enzymatic 
reactions were carried out under standard conditions, in the presence of one of the 2 possible 
inhibitors as follows: 5, 10, 20 and 25 mM 2-chlorobenzaldehyde; 1.5 and 5 mM 2- 

25 chlorobenzoic acid; and 10, 20, 40 and 80 mM 2-chloromandelic acid. Control reactions 

were performed under standard conditions, with no additive. At each of the sampling times, 
the samples were diluted to a level of 1 in 10. Control samples containing the reaction 
components without enzyme were used and diluted to the same level. The samples were 
analysed by non-chiral HPLC. 

30 [000424] (ii) Stability: The stability of the enzymes to process conditions was monitored 
by incubation of the enzymes in the presence of the reaction components, 2- 
chlorobenzaldehyde and 2-chloromandelic acid for predetermined time periods, prior to assay 
of the enzyme activity under standard conditions. In these experiments, the enzymes were 
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incubated at a concentration of 3 mg protein/ml in the presence of each of the following 
reaction components: 5, 10, 20 and 25 mM 2-chlorobenzaldehyde; and 10 3 20, 40 and 80 mM 
2-chloromandelic acid. Control reactions were performed by incubation of the enzyme in 
buffer only. 

[000425] Assay conditions : At 0, 4, 8 and 24 hours of incubation in the particular 
additive, 20 jlxI of the enzyme solution was removed and added to 60 pi of a 41 .6 mM 
substrate stock solution and 20 jul buffer. The enzyme activity was thus assayed under 
standard conditions. The reactions were sampled 90 minutes after substrate addition and 
analyzed using the noh-chiral HPLC method. 
Scale-up of enzymatic reaction 

[000426] The enzymatic reactions were run at two difference concentrations: 45 mM 
and 90 mM substrate. The reactions were run under standard conditions, i.e. pH 8 (0.1 M 
sodium phosphate buffer), 37°C and 10% (v/v) of the enzyme stock solution. The substrate 
was dissolved in 10% (v/v/) methanol prior to addition of the buffer. The final reaction 
volume was 20 ml and the reactions were performed with magnetic stirring. 

Example 14: Optimization Of Nitrilases For The Enantioselective Production Of 
L-2-amino-6,6-dimethoxvhexanoic acid 



MeO. 
MeCT 




5,5-dimethoxypentanal 5,5-dimethoxypentanal L-2-amino-6,6- 

aminonitrile dimethoxyhexanoic 

acid 

[000427] Four of the isolated enzymes were shown to hydrolyze 2-amino-6-hydroxy 
hexanenitrile to (S)-2-amino-6-hydroxy hexanoic acid, with selectivity towards the L- 
enantiomer. A new target, with a similar structure to (S)-2-amino-6-hydroxy hexanoic acid 
was identified. A panel of the isolated nitrilases are screened against the target, 5,5- 
dimethoxypentanal aminonitrile. The positive enzymes are characterized on this substrate. 
Laboratory evolution techniques can be used to optimize these nitrilases for improved 
enantiospecificity towards the specified target. A primary screen is used to identify putative 
up-mutants, which is confirmed using HPLC. 

[000428] Optimization of enzymes: GSSM™ and GeneReassembly™ can be performed 
on selected nitrilases, in order to improve the enantioselectivity and activity of the enzymes 

113 



WO 03/097810 PCT/US03/15712 

for the production of L-2-amino-6,6-dimethoxyhexanoic acid. Four enzymes were identified 
that can hydrolyze enantioselectively 2-amino-6-hydroxy hexanenitrile to L-(S)-2-amino-6- 
hydroxy hexanoic acid. However, a slight structural difference is present in the new target 
molecule, L-2-amino-6,6-dimethoxyhexanoic acid. In order to determine whether this 
5 difference affects the activity and enantioselectivity of the enzymes, the complete spectrum 
of nitrilases is screened against the new target. 

[000429] An enzyme exhibiting the highest combination of activity and 
enantioselectivity for the production of L-2-amino-6,6-dimethoxyhexanoic is selected for 
GSSM™. Following the mutation of the target enzyme, the resulting mutants will be 

10 screened on 5,5-dimethoxypentanal aminonitrile, using high throughput screening 

technology. Following confirmation of the up-mutants by HPLC analysis, the individual up- 
mutants will be combined in order to further enhance the properties of the mutant enzymes. 
[000430] In parallel to GSSM™, a GeneReassembly™ can be performed on a 
combination of parent enzymes, at least one of which can be selected for activity and 

15 enantioselectivity on L-2-amino-6,6-dimethoxyhexanoic acid. At least two other nitrilases, 
with a high degree of homology, can be reassembled with the former enzyme(s); these 
enzymes will be selected in order to provide diversity to the reassembled sequences. 
[000431] Crucial to the success of this evolution effort is the development of a high 
throughput assay for enantioselectivity. Such an assay is a novel enzyme-based 

20 enantioselectivity assay that allows for the screening of >30,000 mutants in a significantly 
shorter time period than the traditionally used method of HPLC. 

[000432] In one aspect, a non-stochastic method, termed synthetic ligation reassembly, 
that is related to stochastic shuffling, except that the nucleic acid building blocks are not 
shuffled or concatenated or chimerized randomly, but rather are assembled non- 
25 stochastically, can be used to create variants. This method does not require the presence of 
high homology between nucleic acids to be shuffled. The ligation reassembly method can be 
used to non-stochastically generate libraries (or sets) of progeny molecules having at least 
10 100 or at least 10 1000 different chimeras. The ligation reassembly method provides a non- 
stochastic method of producing a set of finalized chimeric nucleic acids that have an overall 
30 assembly order that is chosen by design, which method is comprised of the steps of 

generating by design a plurality of specific nucleic acid building blocks having serviceable 
mutally compatible ligatable ends, as assembling these nucleic acid building blocks, such that 
a designed overall assembly order is achieved. 
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[000433] The mutually compatible ligatable ends of the nucleic acid building blocks to 
be assembled are considered to be "serviceable" for this type of ordered assembly if they 
enable the building blocks to be coupled in predetermined orders. Thus, in one aspect, the 
overall assembly order in which the nucleic acid building blocks can be coupled is specified 
5 by the design of the ligatable ends and, if more than one assembly step is to be used, then the 
overall assembly order in which the nucleic acid building blocks can be coupled is also 
specified by the sequential order of the assembly step(s). In a one aspect of the invention, the 
annealed building pieces are treated with an enzyme, such as a ligase (e.g., T4 DNA ligase) to 
achieve covalent bonding of the building pieces. 

10 [000434] In a another aspect, the design of nucleic acid building blocks is obtained upon 
analysis of the sequences of a set of progenitor nucleic acid templates that serve as a basis for 
producing a progeny set of finalized chimeric nucleic acid molecules. These progenitor 
nucleic acid templates thus serve as a source of sequence information that aids in the design 
of the nucleic acid building blocks that are to be mutagenized, i.e. chimerized, recombined or 

15 shuffled. 

[000435] In one exemplification, the invention provides for the chimerization of a 
family of related genes and their encoded family of related products. In a particular 
exemplification, the encoded products are nitrilase enzymes. Nucleic acids encoding the 
nitrilases of the invention can be mutagenized in accordance with the methods described 
20 herein. 

[000436] Thus, according to one aspect of the invention, the sequences of a plurality of 
progenitor nucleic acid templates encoding nitrilases are aligned in order to select one or 
more demarcation points, which demarcation points can be located at an area of homology. 
The demarcation points can be used to delineate the boundaries of nucleic acid building 

25 blocks to be generated. Thus, the demarcation points identified and selected in the progenitor 
molecules serve as potential chimerization points in the assembly of the progeny molecules. 
[000437] Typically a serviceable demarcation point is an area of homology (comprised 
of at least one homologous nucleotide base) shared by at least two progenitor templates, but 
the demarcation point can be an area of homology that is shared by at least half of the 

30 progenitor templates, at least two thirds of the progenitor templates, at least three fourths of 
the progenitor templates, and preferably at almost all of the progenitor templates. Even more 
preferably still a serviceable demarcation point is an area of homology that is shared by all of 
the progenitor templates. 
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[000438] In a one aspect, the ligation reassembly process is performed exhaustively in 
order to generate an exhaustive library. In other words, all possible ordered combinations of 
the nucleic acid building blocks are represented in the set of finalized chimeric nucleic acid 
molecules. At the same time, the assembly order (i.e., the order of assembly of each building 
5 block in the 5' to 3 ! sequence of each finalized chimeric nucleic acid) in each combination is 
by design (or non-stochastic, non-random). Because of the non-stochastic nature of the 
method, the possibility of unwanted side products is greatly reduced. 

[000439] In another aspect, the method provides that, the ligation reassembly process is 
performed systematically, for example in order to generate a systematically 

10 compartmentalized library, with compartments that can be screened systematically, e.g., one 
by one. Each compartment (or portion) holds chimeras or recombinants with known 
characteristics. In other words the invention provides that, through the selective and 
judicious use of specific nucleic acid building blocks, coupled with the selective and 
judicious use of sequentially stepped assembly reactions, an experimental design can be 

15 achieved where specific sets of progeny products are made in each of several reaction 
vessels. This allows a systematic examination and screening procedure to be performed. 
Thus, it allows a potentially very large number of progeny molecules to be examined 
systematically in smaller groups. 

[000440] Because of its ability to perform chimerizations in a manner that is highly 
20 flexible, yet exhaustive and systematic, particularly when there is a low level of homology 
among the progenitor molecules, the invention described herein provides for the generation 
of a library (or set) comprised of a large number of progeny molecules. Because of the non- 
stochastic nature of the ligation reassembly method, the progeny molecules generated 
preferably comprise a library of finalized chimeric nucleic acid molecules having an overall 
25 assembly order that is chosen by design. In a particularly aspect, such a generated library is 
comprised of greater than 10 3 to greater than 10 1000 different progeny molecular species. 
[000441] In another exemplification, the synthetic nature of the step in which the 
building blocks are generated allows the design and introduction of nucleotides (e.g., one or 
more nucleotides, which may be, for example, codons or introns or regulatory sequences) that 
30 can later be optionally removed in an in vitro process (e.g., by mutageneis) or in an in vivo 
process (e.g., by utilizing the gene splicing ability of a host organism). It is appreciated that 
in many instances the introduction of these nucleotides may also be desirable for many other 
reasons in addition to the potential benefit of creating a serviceable demarcation point. 
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[000442] The synthetic ligation reassembly method of the invention utilizes a plurality 
of nucleic acid building blocks, each of which preferably has two ligatable ends. The two 
ligatable ends on each nucleic acid building block may be two blunt ends (i.e. each having an 
overhang of zero nucleotides), or preferably one blunt end and one overhang, or more 
5 preferably still two overhangs. On a double-stranded nucleic acid, a useful overhang can be a 
3' overhang, or a 5* overhang. A nucleic acid building block can have a 3' overhang, a 5 5 
overhang, two 3' overhangs, or two 5 ? overhangs. The overall order in which the nucleic 
acid building blocks are assembled to form a finalized chimeric nucleic acid molecule is 
determined by purposeful experimental design (e.g., by designing sticky ends between 
10 building block nucleic acids based on the sequence of the 5 f and 3' overhangs) and is not 
random. 

[000443] According to one preferred aspect, a nucleic acid building block is generated 
by chemical synthesis of two single-stranded nucleic acids (also referred to as single- 
stranded oligos) and contacting them together under hybridization conditions so as to allow 

15 them to anneal to form a double-stranded nucleic acid building block. A double-stranded 

nucleic acid building block can be of variable size. The sizes of these building blocks can be 
small or large. Preferred sizes for building block range from 1 base pair (not including any 
overhangs) to 100,000 base pairs (not including any overhangs). Other preferred size ranges 
are also provided, which have lower limits of from 1 bp to 10,000 bp (including every integer 

20 value in between), and upper limits of from 2 bp to 100, 000 bp (including every integer 
value in between). 

[000444] According to one aspect, a double-stranded nucleic acid building block is 
generated by first generating two single stranded nucleic acids and allowing them to anneal to 
form a double-stranded nucleic acid building block. The two strands of a double-stranded 

25 nucleic acid building block may be complementary at every nucleotide apart from any that 

form an overhang; thus containing no mismatches, apart from any overhang(s). According to 
another aspect, the two strands of a double-stranded nucleic acid building block are 
complementary at fewer than every nucleotide apart from any that form an overhang. Thus, 
according to this aspect, a double-stranded nucleic acid building block can be used to 

30 introduce codon degeneracy. Preferably the codon degeneracy is introduced using the site- 
saturation mutagenesis described herein, using one or more N,N,GIT cassettes or 
alternatively using one or more N,N,N cassettes. 

Example 15: Assays for Evaluation of Nitrilase Activity and Enantioselectivity 
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[000445] An assay method amenable to high throughput automation to increase the 
screening throughput both of the discovery and evolution efforts for nitrilases is described. 
The ideal assay is one that permits quantification of both product formation or substrate 
conversion and also enantiomeric excess. Two achiral and two chiral colorimetric assays that 
are amenable to high throughput screening were developed. 

Achiral Colorimetic Assays Developed: 

[000446] OPA assay for residual substrate. The OPA assay is Applicable to a-amino or 
oc-hydroxy nitrite substrates. The lysis of whole cells is not necessary. These results were 
corroborated by HPLC for 2-chloromandelonitrile and phenyl acetaldehyde cyanohydrin. 
The assay works best with aromatic nitrites. Aliphatic compounds exhibit a linear standard 
curve, fluorescence is reduced, reducing the efficacy of the assay. 

[000447] LDH Assay for quantification and ee determination of hvdroxvacid formed. 
The LDH assay is applicable to phenyl lactic acid but not to 2-chloromandelic acid. Use of a 
resazurin detection system increases sensitivity and reduces background. Background 
fluorescence of whole cells was overcome either by centrifugation or heat inactivation prior 
to performing assay. 

[000448] AAO Assay for quantification and ee determination of aminoacid formed. The 
AAO assay is applicable to phenylalanine and (S)-2-amino-6-hydroxy hexanoic acid. The use 
of the Amplex Red detection system increases sensitivity. Cell lysis was shown not be 
necessary. Cells are grown in defined media in order to prevent background fluorescence. 
OPA Assay 

[000449] The o-phthalaldehyde (OPA) fluorescence based nitrilase assay is used to 
quantify the amount of a-hydroxynitrile substrate remaining. OPA reacts with the cyanide 
released from the pH controlled decomposition of a-hydroxynitriles to the corresponding 
aldehyde and cyanide to yield a fluorescent, quantifiable product. OPA reacts with the 
cyanide released from the pH controlled decomposition of a-hydroxynitriles to the 
corresponding aldehyde and cyanide to yield the fluorescent l-cyano-2-R benzoisoindole. 
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[000450] Standard curves were established for the following substrates: 2- 
Chloromandelonitrile (CMN, 0.998), Cyclohexylmandelonitrile (CHMN, 0.99), 
Acetophenone aminonitrile (APA, 0.99), and Phenylacetaldehyde cyanohydrin (PAC, 0.97), 
(Figure 5), (R 2 values in parentheses). A standard curve for Phenylglycine (PGN, 0.93) was 
also established. Three of the substrates tested, Dimethylbutanal aminonitrile (DMB) (2- 
amino-4,4-dimethyl pentanenitrile), Hydroxypivaldehyde aminonitrile (HP A) and 
Pivaldehyde aminonitrile (PAH), gave very low fluorescence readings and unreliable results 
under the original assay conditions. For these compounds a number of parameters where 
adjusted, however the fluorescent signal strength of these compounds was not increased by 
these manipulations. 

[000451] In an attempt to increase the fluorescent signal of these three compounds, 
naphthalene dicarboxaldehyde (NDA) was substituted for OP A. Standard curves for PAH, 
HPA and DMB with either OPA or NDA were constructed. To determine sensitivity and 
background fluorescence, a lyophilized nitrilase lysate (SEQ ID NOS:189, 190) with 
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suspected catalytic activity on each of the substrates was added. Hydrolysis was detected in 
three out of four of the compounds. NDA sharply boosted the signal, often by an order of 
magnitude, though this reduced linearity is presumably due to signal saturation. 
[000452] NDA was established as an alternative detection reagent for the aliphatic 
5 compounds. However, it is desirable for the assay to utilize the same detection system for all 
of the substrates since this would facilitate the automated evaluation of multiple nitrilase 
substrates. The current OPA based assay is effective for the analysis of PAC, CMN, CHMN, 
APA ? MN and PGN. While standard curves have been developed for the aliphatic 
compounds PAH, HP A, and DMB. 

10 Whole cell optimization 

[000453] The effect of addition of lyophilized nitrilase lysate to the assay components, 
either untreated or heat inactivated, was evaluated. Interfering background fluorescence was 
not observed in either case. The OPA assay was next evaluated and optimized for nitrilase 
activity detection in a whole cell format. Both nitrilase expressing whole cells and zw-situ 

15 lysed cells were evaluated. Lyophilized cell lysates were evaluated alongside their respective 
whole cell clones as controls. For this optimization study, mandelonitrile (MN) was chosen as 
a model substrate. 

[000454] The lyophilized cell lysate of SEQ ID NOS:187, 188 was evaluated alongside 
whole cells expressing SEQ ID NOS:187, 188 and in situ lysed cells expressing SEQ ID 

20 NOS: 1 87, 188 The addition of whole cells did not affect fluorescence nor result in 
fluorescence quenching. Addition of any of the three cell lysis solutions improved 
permeability (and therefore conversion) of mandelonitrile in the whole cell systems. Three 
cell lysing solutions were evaluated: B-PER (Pierce), BugBuster (Novagen) and CelLytic B- 
II (Sigma) and were found not to have a deleterious affect on the OPA assay. The addition of 

25 product a-hydroxyacid or a-aminoacid did not affect detection by the OPA assay. 

[000455] The assay was modified from its original format, which required several liquid 
transfer steps, into a one plate process, where cell growth, nitrile hydrolysis and OPA assay 
reaction occurred in the same microtiter plate. Mandelonitrile was tested using this single 
well format. In this case, the E. coli. Gene site-saturation mutagenesis (GSSM™ ) cell host 

30 was evaluated. Three clones were tested: SEQ ID NOS:101, 102, SEQ ID NOS:187, 188, 
and an empty vector, which was used as a control. Hydrolysis was evaluated at four 
timepoints, at 10 and 20 mM, and also with a 0 mM control. In an earlier experiment, clone 
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SEQ ID NOS:187, 188 was evaluated against the phenylacetaldehyde cyanohydrin substrate 
(for which this enzyme does not exhibit activity), and no activity was observed. 
[000456] The OPA assay was found to detect the presence of both a-hydroxy and a- 
amino nitrile substrate. Aromatic compounds were readily detectable with the assay, while 
aliphatic compounds posed some detection challenges. No background issues were evident 
when using lyophilized cell lysates, in-situ lysed whole cells or unlysed whole cells. The 
assay is amenable to one-plate analysis, where cells are grown, incubated with the substrate, 
and assayed on the same plate: no liquid transfers are required, easing automation. While all 
mtriles tested produced a linear response, aliphatic compounds gave a low fluorescent 
response. 
Chiral LDH Assay 

[000457] A spectroscopic system based on lactate dehydrogenase (L-LDH) was 
developed for the analysis of the chiral a-hydroxy acids which are generated by the nitrilase 
catalyzed hydrolysis of cyanohydrins. The hydroxynitrile substrate is not metabolized by the 
secondary or detection enzyme and thus starting material does not interfere. Cell lysate 
which is not heat treated results in background activity for the LDH system; however, heat 
inactivation or pelleting of the cell lysates eliminates the background activity. (See Figure 
4.) 

[000458] The activity and enantiomeric specificity of commercially available D- and L- 
lactate dehydrogenases against the nitrilases disclosed herein was evaluated. An LDH was 
identified which is suitable to both D- and L-phenyl lactic acid analysis. An enzyme suitable 
for 2-Chloromandelic acid analysis was not found. The chosen LDH enzymes exhibited 
virtually absolute stereoselectivity. The viability of the assay to detect D- and L-LDH 
produced from PAC using lyophilized cell lysate was established. 
[000459] Originally, three colorimetric dyes were evaluated, all of which are 
tetrazohum salts: NBT (3,3 , -dimethoxy-4,4'-biphenylene)bis[2,(4-nitrophenyl) -5-phenyl- 
2H]- 3 chloride) MTT (3 -(4,5-dimethylthiazol-2-yl)-2,5-diphenyltetrazolium bromide) INT 
(2-(4-Iodophenyl)-3-(4-nitrophenyl)-5-phenyl- 2H-tetrazolium chloride). The insolubility of 
the product of these detection system posed an analytical challenge. To address this, another 
tetrazolium salt with a reportedly soluble product, XTT (2,3-Bis-(2-methoxy-4-nitro-5- 
sulfophenyl) -2i7-tetrazolium-5-carboxanilide, was evaluated. While XTT yielded a soluble 
bright red product, the substrate was insoluble which thus effected the same analytical 
challenges. As an alternative to the tetrazolium family of dyes, the dual 
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colorimetric/fluorometric dye resazurin was evaluated. Oxidation of resazurin produces 
resourfm. Both substrate and product are soluble, and the color change can be quantified 
colorimetrically or fluorimetrically, increasing accuracy. Due to the sensitivity of resazurin, 
0.05 mM of lactic acid can be quantified. Optimal results were obtained when using the dye 
in the same range as the substrate, e.g. 0.5 mM resazurin can quantify a range of lactic (and 
analogs) from 0.05 to 0.5, though the best linearity is at the lower end of this scale. Resourfm 
was stable over 28 hours, and had a linear fluorescent response. 

[000460] In the presence of the LDH assay components, lyophilized enzyme gave 
background fluorescence/absorption. To address this problem the lysate was boiled for 10 
minutes and then centrifuged. This resulted in a 90% decrease in background signal. 
Interestingly, both centrifugation alone (5 minutes @ 14.1 rcf) or boiling followed by 
centrifugation (5 minutes @ 100°C) reduced the fluorescence to background levels. In a 
high-throughput format such as 1536 well plates, spinning would be preferable to boiling, as 
boiling would increase evaporation (8 ul well size) and potentially volatize the nitrile 
substrates. No background signal resulting from growth media (LB and TB and M9) or cell 
lytic solutions (B-PER, CelLytic and BugBuster) was noted. 



Chiral AAO Assay 

[000461] A spectroscopic system based on amino acid oxidase (AAO) was developed 
for the analysis of the chiral cc-amino acids which are generated by the nitrilase catalyzed 
hydrolysis of amino nitriles. 
Assay Development and Validation 

[000462] The initial assay validation utilized the 2,T -azino-di- {3-ethylbenzothiazoline- 
6-sulfonic acid (ABTS) detection system as outlined above. However, since the color was not 
stable further investigations utilized the phenol amino antipyrine (PAAP) detection system 
which is analyzed at X max 510nm. Enzymes with suitable activity were found for each 
enantiomer of 4-methyl- leucine, phenylalanine, (S)-2-amino-6-hydroxy hexanoic acid, and 
tert-leucine. The assay is not applicable to methylphenylglycine and does not work well with 
phenylglycine. 

[000463] Standard curves were generated for phenylalanine from 0-15 mM. The curve 
is much more linear when the concentrations remained below 1 mM. The color remains 
stable for several days as long as it is kept in the dark. Three cell lysing solutions Bug Buster 
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(BB), Bacterial Protein Extracting Reagent (BPER), and Cell Lytic Reagent (CLR) were 
added to the standard curve and shown to have no affect on color development. The addition 
of cell lysate (cl) did not exhibit background color formation. Addition of the 
phenylacetaldehyde aminonitrile sulfate (PAS) starting material also showed no effect on 
color formation. 

[000464] The AAO system exhibits greater linearity at up to 1 mM substrate. The 
concentration of the AAO enzymes and of the acid substrate were adjusted to try to move the 
intersection of the L-AAO and D-AAO curves closer to the middle of the graph. Premixing 
the PAAP, the HRP, and the AAO was demonstrated to be effective and caused no change in 
observed activity establishing that the assay components may be added to the assay in a 
cocktail format. 

[000465] A high level of background was observed for the AAO assay of whole cells 
and this was attributed to the L-amino acids present in the TB and LB growth media. 
Washing and resuspension of the cells in M9 media eliminated background. For all future 
experiments cells were grown in M9 media with 0.2% glucose. The lysed cells gave only a 
slightly better response that unlysed cells. Therefore, cell lysis is not necessary. SEQ ID 
NOS:187, 188 demonstrated activity on HP A in primary screening based on HPLC analysis. 
[000466] The use of a fluorescent detection system which would permits 
implementation of the assay in ultra high throughput fashion such as 1536 well or gigamatrix 
format was investigated. The fluorescent reagent most applicable to our system is Amplex 
Red from Molecular Probes which produces the highly fluorescent resorufin Ck- 545 nnr JL 
590nm) Standard curves for phenylalanine and (S)-2-amino-6-hydroxy hexanoic acid were 
established (0-100 \iM). 

[000467] In preparation for assay automation, nitrilase expressing cells were added into 
microtiter plate containing M9 0.2% glucose,0.25 mM IPTG media by florescence activated 
cell sorting (FACS). Three nitrilase expressing subclones, and the empty vector control were 
evaluated: SEQ ID NOS:101, 102, SEQ ID NOS:187, 188, SEQ ID NOS:29, 30 and the 
empty vector. The viability of the cells following cell sorting proved to be inconsistent. Thus 
colony picking is currently being evaluated as an alternative method to add cells into 
microtiter plates. The evaporative loss from an uncovered 1536-well microtiter plate is 
approximately 30% per day in the robot incubator (incubator conditions: 37°C at 85% 
relative humidity (RH)). Incubation in the 95% RH incubator reduced evaporative loss to 1% 
per day. 
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[000468] The ability of the three subclones to grow in the presence of up to 3.5 mM of 
nitrile was established using HP A nitrile. Growth rates were only slightly retarded (less that 
30%). Subclones grown in the presence of HP A were shown to express a nitrilase that 
catalyzes the formation of hydroxy norleucine (HNL) as established using the Amplex Red 
detection system. Only S was evaluated as the enzymes are ^-selective. The reaction plate 
was read at 10 minute intervals, with 40 minutes showing the best linearity. While cell 
growth is significantly inhibited above 5 mM of HP A when the cells were grown at pH 7, 
growth was inhibited above 0.1 mM HP A for cells grown at pH 8. 

[000469] In order to verify the AAO results by HPLC, a reaction was performed using 
high concentrations of HP A, up to 40 mM (due to HPLC detection challenges for (S)-2- 
amino-6-hydroxy hexanoic acid) and lyophilized cell lysate SEQ ID NOS:187, 188. 



Comparison AAO and HPLC data for HNL 
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89% 
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89% 


97% 


29% 


36% 


20 


86% 


97% 


21% 


34% 


10 


78% 


98% 


13% 


35% 



[000470] In order to determine if conducting the screen at a lower concentration 
introduces a bias in the results compared to the 20 mM substrate range that was used for 
HPLC based screens, an experiment was performed with SEQ ID NOS:187, 188 using three 
concentration ranges. Each experiment was done in triplicate in order to remove any 
nonsystematic error. 

[000471] The AAO assay can be run on 384 or 1536 well format with cells sorted into 

an M9 0.2% glucose, 0.25 mM 1PTG media. Cells can be grown in the presence of nitrile (in 

this case HP A), or the cells can be allowed to reach a certain density and the nitrile can then 

be added. Though cell lytic reagents do not interfere with the assay, when HP A was assayed, 

addition of the lytic reagents was found to be unnecessary. Either pre- or post- nitrile 

addition, the mother plate will have to be split into daughter plates, which are then assayed 

for the respective L- and D- enantiomer content. Incubation times with the AAO/Amplex 

Red reagents can be adjusted so that the D- and L- plate are read at separate times. 
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Exam ple 16: Identification, Development and Production of Robust, Novel Enzymes 
Targeted for a Series of High-Value Enantioselective Bioprocesses 
[000472] The invention provides for the development of nitrilases, through directed 
evolution, which provide significant technical and commercial advantages for the process 
5 manufacturing of the following chemical target: 



L-2-amino-6 ? 6-dimethoxyhexanoic acid 

10 

H 2 N^ H 

H CN, NH 4 + ^ e ^j H 2 N CN OMe _y^COOH 

MeO. _ ^^^^\^\ ) ' 

>-^\^\^cho OMe H OMe/ 

MeO 

5,5-dimethoxypentanal 5,5-dimethoxypentanal L-2-amino-6,6- 

yK aminonitrile dimethoxyhexanoic 

[000473] acid 

Nitrilase enzymes were shown to hydrolyze 2-amino-6-hydroxy hexanenitrile to (S)- 

2-amino-6-hydroxy hexanoic acid, with selectivity towards the L-enantiomer. The panel of 
nitrilases was screened against the target, 5,5-dimethoxypentanal aminonitrile. The positive 
1 5 enzymes were characterized on this substrate. A primary screen is used to identify putative 
up-mutants, which is then confirmed using HPLC. 

[000474] GSSM™ and GeneReassembly™ are performed on selected nitrilases, in 
order to improve the enantioselectivity and activity of the enzymes for the production of L-2- 
amino-6,6-dimethoxyhexanoic acid. Nitrilases were identified for the enantioselective 

20 hydrolysis of 2-amino-6-hydroxy hexanenitrile to L-(S)-2-amino-6-hydroxy hexanoic acid. 
However, a slight structural difference is presented by the new target molecule, L-2-amino- 
6,6-dimethoxyhexanoic acid. In order to determine whether this difference affects the 
activity and enantioselectivity of the enzymes, the complete spectrum of nitrilases was 
screened against the new target. 

25 [000475] First, identification of the correct target gene for GSSM through more detailed 
characterization of the hit enzymes for the production of L-2-amino-6,6-dimethoxyhexanoic 
acid was carried out. This effort involves a more extensive investigation of the effects of pH 
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and temperature on activity and enantioselectivity and a more in-depth analysis of the 
stability of the enzyme to process conditions. Prior to initiation of the screening, the 
synthesis of a single enantiomer of an alkyl aminonitrile is done; the racemization of this 
nitrile is studied, in an effort to understand the relationship between this factor and 
5 enantioselectivity of the enzymes. 

An enzyme exhibiting the highest combination of activity and enantioselectivity for 
the production of L-2-amino-6,6-dimethoxyhexanoic acid is selected for GSSM. Following 
the mutation of the target enzyme, the resulting mutants are screened on 5,5- 
dimethoxypentanal aminonitrile, using high throughput screening technology. Following 
10 confirmation of the up-mutants by HPLC analysis a decision point is reached, in order to 
evaluate the results of the GSSM on the target. 

[000476] In parallel to GSSM™, a GeneReassembly™ is performed on a combination 
of parent enzymes, at least one of which is selected for activity and enantioselectivity on L-2- 
amino-6,6-dimethoxyhexanoic acid. At least two other nitrilases are reassembled with the 
15 former enzyme(s); these enzymes are selected in order to provide diversity to the reassembled 
sequences. 

[000477] The present invention provides for development of racemization conditions for 
the original substrate aminonitriles. In addition, the present invention provides for the 
identification of enzymes capable of the conversion of these aminonitriles to the target a- 

20 amino acids by dynamic kinetic resolution. The present invention also provides for 

screening and development of a nitrilase-catalyzed kinetic resolution process for (R)-2- 
amino-6,6-dimethoxy hexanoic acid (allysine) production. (S)-2-amino-6-hydroxy hexanoic 
acid will be used as a model substrate for development of the kinetic resolution. 
The target a-amino acid products are shown below: 

25 (i) D-4-Fluorophenylglycine 




4-fluorobenzaldehyde 4-fluorophenylglycinonitrile (FPGN) D-4-fluorophenylglycine 

(ii) L-2-Amino-6,6-dimethoxyhexanoic acid (Allysine) 
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MeO > ^^ CHO HCN.NH^ °Me NH 2 nitri|ase ^ OMe *H 2 

5,5-dimethoxypentanal 5,5-dimethoxypentanal L-2-amino-6,6-dimethoxy- 

aminonitrile (DMPAN) hexanoic acid 

[000478] Conditions are developed for the racemization of the aminonitrile substrates 

for the nitrilase-catalyzed production of D-4-fluorophenylglycine and 2-amino-4,4-dimethyl 

pentanenitrile (allysine). Two model substrates, phenylglycinonitrile and pentanal 

5 aminonitrile are used initially, and racemization is studied in the absence of the enzyme. 

Concurrently determination of the performance of one or more available nitrilases under a 

variety of possible racemization conditions is carried out. In addition, the nitrilases are 

screened against hydroxypentanal aminonitrile for the production of (S)-2-amino-6-hydroxy 

hexanoic acid, and the promising enzymes are optimized. Once racemization conditions are 

10 established, the nitrilases are screened for activity. Further optimization for a kinetic 

resolution of the product is performed. 

[000479] A number of enantioselective nitrilases were identified for the hydrolysis of cc- 
aminonitriles to a-amino acids. While these enzymes were shown to have a preference for 
the required enantiomer of certain aminonitriles, a limiting factor in the further screening, 
15 development and comparison of candidate nitrilases is the rate of racemization of the 
aminonitrile substrates under the reaction conditions. 

Aromatic aminonitrile racemization 

[000480] The first step is to establish conditions tinder which aromatic aminonitrile 
racemization occurs, using the model substrate, phenylglycinonitrile. Racemization strategies 
20 include, but are not limited to the list below. Options are roughly prioritized according to 
their commercial applicability. 

(1) Manipulation of the pH of the reaction. Since it has been shown that racemization is rapid at high 
pH 5 this approach requires the discovery and optimization of nitrilases which are active and selective 
at pH>10. 

25 (2) Addition of known chemical racemizing agents, such as aldehydes, ketones, weak bases, resins, 
metal ions, Lewis acids etc., which can enhance racemization at lower pH. 

(3) Synthesis of N-acylated aminonitrile derivatives, e.g. N-acetyl phenylglycinonitrile, which may be 
more easily racemized. In the case of N-acetyl phenylglycinonitrile, a selective D-acylase which 
removes the acetyl group would enhance the optical purity of the nitrilase product. 
30 (4) Use of a biphasic system in which base-catalyzed racemization occurs in the hydrophobic organic 
phase and enzymatic hydrolysis in the aqueous phase. 
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(5) Use of a 2-enzyme system comprised of a nitrilase and an aminonitrile racemase. One amino acid 
racemase is commercially available at present, and will be tested for activity against phenyl- and 
fluorophenylglycinonitrile. Gene libraries will be searched for genes showing homology to known 
amino acid amide racemases, hydantoin racemases or any other racemases which can be identified. 
5 [000481] Once conditions for this racemization have been established, they provide the 
basis for development of conditions for racemization of the target aromatic substrate, 4- 
fluorophenylglycinonitrile (FPGN). The FPGN is expected to be less stable than the model 
substrate; thus, it may racemize more quickly, but degradation reactions may be faster as 
well. The ability of sample enzyme(s) to tolerate and/or function well under them is 
10 evaluated. Final optimization of screening methods include the target substrates, sample 
nitrilases, and substrate racemization conditions. 

[000482] Investigations carried out have shown that phenylglycinonitrile is easily 
racemized at pH 10.8. However, it does not appear that any of the existing enzymes can 
tolerate such harsh conditions of pH. Samples from highly alkaline environments are 
15 screened for the presence of nitrilases which are tolerant to such conditions. Once 

discovered, the enzymes are sequenced and sub cloned, and the enzymes are produced as 
lyophilized cell lysates ready for screening. 
Aliphatic aminonitrile racemization 

[000483] A model aliphatic aminonitrile, pentanal aminonitrile, is synthesized in its 
20 racemic form. However optically enriched samples are prepared using one the following 
approaches: (i) preparative chiral HPLC; (ii) diastereomeric salt resolution; (iii) 
diastereomeric derivatization or column chromatography; (iv) synthesis from L-iV-BOC 
norleucine. An HPLC assay is used for the detection of these compounds. 
HPLC Assay 

25 [000484] An HPLC assay for the detection of the (S)-2-amino-6-hydroxy hexanoic acid 
is used. An assay involving pre-column derivatization is used. 
Screening/Characterization: 

[000485] Nitrilases are screened against 2-amino-6-hydroxy hexanenitrile. For enzymes 
capable of performing well at greater than 25 mM substrate, scale up reactions are performed. 
30 The substrate/product tolerance and stability profiles of the other enzymes are investigated. 
[000486] The nitrilases are screened, and hits are characterized, focusing on pH and 
temperature optimum, enantioselectivity and stability under the reaction conditions. 
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Enzyme Evolution 

[000487] A target enzyme exhibiting the desired properties is selected for GSSM™. 
Following the mutation of the target enzyme, the resulting mutants are screened on the 
substrate using high throughput screening technology. Once the up -mutants have been 
confirmed by HPLC analysis, the individual mutations responsible for increased performance 
may be combined and evaluated for possible additive or synergistic effects. 
[000488] In addition, a GeneReassembly™ will be performed on a combination of lead 
enzymes, which are selected for their desirable characteristics, including activity, 
enantioselectivity and stability in the reaction. 

Example 17: Optimization of Nitrilases for the Enantioselective 
Production of (SVPhenyllactic Acid 

[000489] Nitrilases were identified for the enantioselective hydrolysis of 5 different nitrile 
substrates. These nitrilases were isolated and optimized for selected targets. The optimization 
involves process optimization and directed evolution. In particular, enzymes specific for the 
production of (S)-phenyllactic acid were characterized and optimized. This was aimed 
primarily at improving the activity of the enzymes, while maintaining a high 
enantioselectivity. An investigation into the effects of process conditions on the enzymes 
was also performed. 



"CHO 
HCN 




COOH 



HO H 



Phenylacetaldehyde 



Phenylacetaldehyde 
cyanohydrin 



(S)-Phenyllactic acid 



20 [000490] The development of high throughput assays for screening of mutants from 
potential directed evolution efforts was accomplished. Two achiral and two chiral 
colorimetric assays that are amenable to high throughput screening were developed and used 
for nitrilase directed evolution. 

[000491] SEQ ID NOS:103, 104 was identified as a highly enantioselective nitrilase for 
25 the production of (S)- phenyllactic acid. Characterization of SEQ ID NOS : 1 03, 1 04 shows 
the optimum reaction pH and temperature to be pH 8 and 37°C, respectively; the reaction 
starting material, phenylacetaldehyde, and the product, phenyllactic acid showed no effect on 
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the enzyme activity up to levels of 5 mM and 30 mM, respectively. The scaled-up enzymatic 
reaction with an enantiomeric excess (ee) of 95%. 

Example 18: Directed Evolution of a nucleic acid encoding a nitrilase enzyme. 
[000492] The nitB gene (GenBank Accession No. AX025996, from Alcaligenes 
5 faecalis) was subjected to Gene Site Saturated Mutagenesis™ or GSSM™ to generate a 
library of single amino acid substitution mutants covering the entire enzyme. The sequence 
of the "parental" nitB gene used in the directed evolution is SEQ ID NO: 103, 104. A nitB 
mutant library was generated from carrying out GSSM™. This nitB mutant library was then 
screened for clones with increased whole cell hydroxymethylthiobutryonitrile (HMTBN, 
10 which is a nitrilase substrate) activity. The product of the nitrilase reaction on that substrate 
is hydroxymethylthiobutyric acid (HMTBA). 

[000493] Assays were run at 35°C with lOOmM HMTBN and lOOmM K3PO4, pH 7 to 
approximately 30-40% conversion. Two methods were used to quantitate HMTBN 
conversion, one being direct measurement of HMTBS produced by HPLC analysis and the 
15 other being indirect detection of residual HMTBN using the fluorescent cyanide assay, which 
has previously been described. 

[000494] Putative nitB up mutants were subjected to a secondary assay to confirm the 
increased activity. In the secondary assay, up mutants and the wild type control were induced 
in expression medium in shake flasks. Shake flask cultures are then washed with lOOmM 

20 K3PO4, pH7 and resuspended to the same optical density at 660nm. Kinetic assays were then 
performed with the normalized cell resuspensions under the same conditions used in the 
initial assays. Putative up mutants confirmed to have increased HMTBN activity were 
sequenced and tested for increased activity after transformation back into the same expression 
strain to ensure that increases in activity are not due to host mutations. 

25 [000495] A confirmed nitB GSSM™ up-mutant is nitB G46P, which contains a glycine 
(GGT) to proline (CCG) substitution at amino acid 46. The whole cell HMTBN activity of 
this mutant is approximately 50% greater than that of wild type NitB at both 25°C and 35°C. 
Upon identification of the beneficial G46P mutation, GSSM™ was used again to generate a 
pool of double mutants using the nitB G46P template. These mutants all contain the G46P 

30 mutation and an additional single amino acid substitution at a random site. The double 

mutants were assayed for HMTBN activity greater than that of nitB G46P. Double, triple and 
quadruple mutants were created in order to speed up the mutation process and identify 
beneficial mutations more quickly. After the first few beneficial mutations were identified 
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and isolated, they were combined to generate double mutants, the best of which was DM18. 
DM1 8 was used as a template to generate triple mutants. The most active triple mutant was 
TM3 and that was used as a template to generate quadruple mutants. The most active 



quadruple mutant was QM2. The table summarizes these mutations. 



mutant 


mutation 1 


mutation 2 


mutation 3 


mutation 4 


DM18 


R (gcg) 29 C(tgt) 


Y(tac) 207 M (atg) 






TM3 


R (gcg) 29 C(tgt) 


Y(tac) 207 M (atg) 


L(ctt) 170 T(act) 




QM2 


R (gcg) 29 C(tgt) 


Y(tac) 207 M (atg) 


L(ctt) 170 T(act) 


A(gcg) 197N9(aat) 



5 

[000496] The mutants were characterized first by studying their whole cell HMTBN 
activity. At 100 mM HMTBN, the HMTBS production rate of QM2 is 1.2 times greater than 
that of the parental gene. However, at 200 mM HMTBN, the rate of QM2 is 3.6 times that of 
the parental gene. The productivity of these mutants is increased considerably when the 



10 HMTBN concentration is raised from 100 mM to 300 mM. As to conversion rates, TM3 
completely converted the substrate after 270 minutes and both DM18 and SM show greater 
than 75% conversion after this time. To further address the issue of HMTBN concentration 
effects on activity/productivity of NitB, several mutants were assayed at both 400 mM and 
528 mM HMTBN. NitB is essentially inactive at these substrate concentrations, however the 

15 mutants retain significant activity at these concentrations. Ih particular, the activity at these 
high concentrations were essentially the same as their activity at 200 mM substrate. 
Therefore, the mutants can be used over a wide substrate concentration range and provide 
much more flexibility in utility than the NitB parental gene. 

[000497] The mutants were shown to have higher expression levels than the parental 
20 gene and it also appeared tha the QM2 and TM3 mutants contained a greater proportion of 
soluble enzyme than the wild type as seen in SDS-PAGE analysis. As to stability, all of the 
enzymes showed essentially the same stability pattern at both 25°C and 35°C. 
[000498] Finally, the mutants were subjected to codon-optimization. The approach was 
to optimize the codons and therefore increase the expression levels in the particular host cell. 
25 That would, in turn, increase the activity per cell of the enzyme. This resulted in increased 
whole cell activity in the codon-optimized mutants as compared to controls. The increase in 
activity was approximately double the activity. An E. coli expression system was used. 

Example 19:Selected Examples of Compounds Produced From a Nitrilase-Catalvzed 
Reaction 
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[000499] The compounds listed in Figure 15 are selected compounds that can be produced 
from a nitrilase-catalyzed reaction using an enzyme and/or a method of the invention. 
[000500] In addition, the following are potential products which can be made via the 
nitrilase Strecker format. More than 100 amino acids and many new drugs can be produced 
from their respective aldehydes or ketones utilizing the nitrilase enzymes of the invention. 
For example, large market drugs which can be synthesized using nitrilases of the invention 
include homophenylalanine, VASOTEC™, VASOTERIC™, TECZEM™, PRINIVIL™, 
PRINZIDE™, ZESTRIL™, ZESTORETIC™, RAMACE™, TARKA™, MAVIK™, 
TRANDOAPRIL™, TRANDOLAPRILAT™, ALTACE™, ODRIK™, UNIRETIC™, 
LOTENSIN™, LOTREL™, CAPOTEN™, MONOPREL™, TANATRIL™, ACECOL™, 
LONGES™, SPIRAPRIL™, QUINAPRIL™, and CILAZAPRIL™. Other chiral drugs 
include DEMSER™ (alpha-methyl-L-Tyrosine), ALDOCHLOR™, LEVOTHROID™ 
SYNTHROID™, CYTOMEL™, THYOLAR™, HYCODAN™, CUPRIMINE™, 
DEPEN™, PRIMAXIN™, MIGRANOL™, D.H.E.-45, DIOVAN™, CEFOBID™, L- 
DOPA, D-DOPA, D-alpha-methyl-DOPA, L-alpha-methyl-DOPA, L-gamma- 
hydroxyglutamate, D-gamma-hydroxyglutamate, 3-(2-naphthyl)-L-alanine, D-homoserine, 
and L-homoserine. 

[000501] Furthermore, the nitrilase enzymes of the invention can be useful for synthesizing 
the following amino acids. Many of these amino acids have pharmaceutical applications. D- 
phenylglycine, L-phenylglycine, D-hydroxyphenylglycine, L-hydroxyphenylglycine, L- 
tertiary leucine, D-tertiary leucine, D-isoleucine, L-isoleucine, D-norleucine, L-norleucine, 
D-norvaline, L-norvaline, D-2-thienylglycine, L-2-thienylglycine, L-2-aminobutyrate, D-2- 
aminobutyrate, D-cycloleucine, L-cycloleucine, D-2-methylphenylglycine, L-2- 
methylphenylglycine, L-thienylalanine, and D-thienylalanine. 

[000502] The enzymes of the nitrilase enzymes of the invention can be useful for the 
synthesis of the following natural amino acids: glycine, L-alanine, L-valine, L-leucine, L- 
isoleucine, L-phenylalanine, L-tyrosine, L-tryptophan, L-cysteine, L-methionine, L-serine, D- 
serine, L-threonine, L-lysine, L-arginine, L-histidine, L-aspartate, L-glutamate, L-asparagine, 
L-glutamine, and L-proline. The following are examples of unnatural amino acids which can 
be produced using the nitrilase enzymes of the invention. D-alanine, D-valine, D-leucine, D- 
isoleucine, D-phenylalanine, D-tyrosine, D-tryptophan, D-cysteine, D-methionine, D- 
threonine, D-lysine, D-arginine, D-histidine, D-aspartate, D-glutamate, D-asparagine, D- 
glutamine, and D-proline. 
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[000503] Furthermore, nitrilase enzymes of the invention can be used in non-Strecker 
chemical reactions including the synthesis of more chiral drugs such as TAXOTERE™ as 
well as chiral drugs containing 3 -hydroxy- glutaronitrile (a $5.5B market); LIPITOR™, 
BAYCOL™, and LESCOL™. Chiral product targets that are not drugs include 
5 PANTENOL™, L-phosphinothricin, D-phosphinothricin, D-fluorophenylalanine, and L- 
fluorophenylalanine. Finally, nitrilase can be used to produce unnatural amino acid 
compounds lacking a chiral center such as sarcosine, iminodiacetic acid, EDTA, alpha- 
aminobutyrate, and beta- alanine. 

Figure 16 examples of substrates and products produced by the nitrilases of the 
10 invention and/or the methods of the invention. The chemical structures of the substrates and 
of the products are shown. The chemical reactions shown here are non-limiting examples of 
activities of the nitrilases of the invention. 

Example 20: Exemplary Preparation Using a Polypeptide of a Variant of SEO ID NO:210 
[000504] The variant, nitrilase 1506-83-H7A, is SEQ ID NO:210 with the Ala at residue 
15 190 replaced with His. At the codon level, the mutation that occurred was GCT to CAT. 
This variant exhibits improved enantioselectivity in the conversion of 3- 
hydroxyglutarylnitrile (HGN) to (R)-4-Cyano-3 -hydroxybutyrate. 

[000505] This variant has been demonstrated to perform this transformation in 100 mM pH 
7 sodium phosphate buffer at room temperature. This mutant can perform in other buffer 

20 systems and temperatures as well with the potential for providing additional altered 

properties. Exemplary properties include, but are not limited to, altered rates of the reaction, 
% ee, and stability. In particular, the altered properties can be a higher reaction rate, a higher 
% ee, and greater stability. Altered properties can be an increase or decrease of at least 25%, 
30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95% more than 

25 wildtype. 

[000506] This variant was shown to perform the transformation by producing products in 
high enantiomeric excess of 10 mM to 3 M substrate (HGN). Higher or lower substrate 
concentrations are also possible. Enantiomeric excess greater than or equal to 95% have been 
achieved. However, enantiomeric excess can be at least 25%, 30%, 35%, 40%, 45%, 50%, 
30 55%, 60%, 65%, 70%, 75%, 80%, 85%, or 90% more than wildtype. 

[000507] Variants of the SEQ ID NOs: of the invention, can be cloned into expression 
vectors. For example, variants of nucleic acid sequence SEQ ID NO: 195, 205, 207, 209, OR 
237, and nucleotides that encode the variants of amino acid sequence SEQ ID NO:210 can 
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be cloned into exemplary vectors that include, but are not limited to, pSE420 (E. coli 
expression vector) and pMYC (pseudomonas expression vector). 

Example 2 1 : Preparation using variants of the invention : 

[000508] Add 3-Hydroxyglutaronitrile (1 g, 9 rnmol) drop-wise to a stirred solution of 
nitrilase cell lysate (normalized for 150 mg protein content) in 2.12 mL of 100 mM pH 7 
sodium phosphate buffer at room temperature, -22 °C. Stir this 3 M reaction by magnetic stir 
bar for 24 hours at room temperature. Monitor the progress of the reaction by TLC (Thin 
Layer Chromatography) and GC (Gas Chromatography). The reaction should be complete 
within 24 hours. 

[000509] Other variants contemplated herein include, but are not limited to the following: 
Nl 1 IS; A190H, S, Y or T; F191L, V, M, D, G, E, Y or T; M199E, orL; D222L; A55K, G, or 
Q; I60E, or any combination thereof. 

Example 22: Screening Assay for Enantioselective Transformation 

[000510] A new method to screen for enantioselective transformation, for example, of a 
prochiral substrate into a chiral one that affords the ability to monitor enantiomeric excess (% 
ee) of the resultant product is disclosed. This approach can also be applied to detennine 
diasteromeric excess (% de). 

[000511] For example, by labeling one of the two prochiral or enantiotopic moieties in a 
molecule, for example by the use of a heavier or liter isotope, the modification of one of the 
two moieties by a selective catalyst, for example, an enzyme, can be established by mass 
spectroscopy (MS). 

[000512] By performing the exemplary nitrilase reaction on 15 N-(R)-HGN (R) (as shown in 
Figure 17) or 15 N-(S)-HGN, one can determine the enantioselectivity of the enzyme by 
analyzing the amount of each of the two possible labeled versus unlabeled acid products 
which can be formed. 



OH 



15, 




OH 




N-(R)-Hydroxyglutaryl nitrile ( 15 N R-HGN) - 
FM = 111.05 proSse| 

nitrilase 




OH 

"NC^A^COOH FM = 130.04 
'N (S)-4-cyano-3-hydroxy butyric acid 



15, 
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[000513] The screening experiment may be performed in either direction. The screening 
experiment can be used for both the 15N-(R)-and (S)-HGN moieties, hi fact, to ensure that 
the label does not effect any artifactual changes, at the onset, both should be investigated. 



OH 



OH 



proR selective 15 NC>. 
nitriiase 



.COOH FM = 130.04 



15, 



N (R)~4-cyano-3-hydroxy butyric acid 



FM = 1 1 1 .05 proS selective 
nitriiase 



OH 

NC^^^COOH FM = 129.04 
(S)-4-cyano-3-hydroxy butyric acid 



[000514] To equate the observed enantiomeric excess resulting from the nitriiase 
transformation, the following exemplary formula may be applied: 

% ee = {[130]- [129]}/ {[130] + [129]}, where each concentration of the light acid 
(129) and the heavy acid (130) are determined by correlation of the peak area on the mass 
spectrometer to a standard curve or by direct comparison of the areas of each of the 129 and 
130 mass peaks. The actual mass units used to determine the relative amounts of each of the 
two enantiomers (labeled and unlabelled) are dependent on how the mass spectrometer is 
tuned. 

[000515] In some cases, the % ee observed by mass spectrometry may differ by a factor 
from that observed by an alternate analytical technique such as liquid chromatography due to 
background or contaminating peaks resulting from natural isotopic abundance. This does not, 
however, affect the final outcome of the screening process. Exemplary strandard curves for 
quantification of heavy acid and light acid are shown in Figures 14 A and B. 
[000516] The following reaction is a possible synthetic route to prepare, for example, 15N 
(R)-HGN using chemistry techniques known in the art with commercially available starting 
materials. 

OH KC 15 N ^ OH 

[000517] The amount of each of the two possible stereomeric outcomes can be established 
by the use of MS in either positive mode, negative mode and from analysis of either of the 
parental mass or of any fragmentation mass. 
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Example 23: Stability and activity of exemplary enzymes of the invention 
Enzyme stability : 

Wild-type enzyme (SEQ ID NOS:209 and 210) was compared to mutant A190H of SEQ ID 
NOS:209 and 210. In the experiment, each enzyme was incubated at 10 mg/ml in water for 
1 ? 25, 50, 75 and 150 hours at 4°C and at 21°C, on two different substrates: adiponitrile and 
hydroxyglutaryl nitrile. Both enzymes, in all conditions, were found to retain activity for 150 
hours. The wild-type enzyme showed greater activity on adiponitrile, while the mutated 
(A190H) enzyme showed greater activity on hydroxyglutaryl nitrile, as assessed by the 
Nitroprusside Bertholet assay (see, e.g., Fawcett, J. K. & Scott, J. (1960); J. Clin. Path.; Vol. 
13,pgl56). 



GSSM™ Variant of 


100 mM 


2.25 mM 


Time to 


SEQIDNOS:209and 


hydroxyglutaryl 


hydroxyglutaryl 


completion 


210 


nitrile 


nitrile 


(hours) 




ee% 


ee% 


A55G 


96.5 ± 0.4 


Not determined 


>160 


A55K 


94.7 ± 0.2 


Not determined 


>160 


I60E 


96.5 ± 0.5 


Not determined 


>160 


N111S 


95.8 ±0.5 


96.1 ±0.9 


>160 


A190T 


96.5 ± 0.2 


96.6 ± 0.4 


40 


A190S 


96.8 ± 0.2 


95.5 ± 0.7 


40 


A190H 


97.9 ±0.1 


98.1 ±0.1 


15 


F191L 


97.9 ±0.1 


Not determined 


>160 


F191T 


97.9 ±0.1 


Not determined 


>160 


F191M 


97.9 ±0.1 


Not determined 


>160 


F191V 


97.9 ±0.1 


Not determined 


>160 


M199E 


97.9 ±0.1 


Not determined 


160 


M199L 


97.9 ±0.1 


95.4 ±0.1 


>160 


Wild type SEQ ID 


94.5 ±0.1 


87.8 ± 0.2 


24 


NOS:209 and 210 









GSSM mutants with enhanced enantioselectivity 



100 mM reactions were performed with nitrilase expressed from E. coli in whole cell format 
and were complete with 36 hours. 2.25 M reactions were performed with nitrilase as 
lyophilized clarified cell lysate. All % ee data reported are the average of three 
measurements, with standard deviation of the mean. The time for reaction completion was 
approximated by TLC. 

Specifically: 
Nitrilase Activity Assay. 100 mM HGN : 
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Putative nitrilase up-mutants were assayed in triplicate. Each transformant was grown in 5 
mL LB (100 jxg/mL ampicillin), at 37 °C, 220 rpm for 18 h. The overnight culture was diluted 

2- fold and nitrilase expression induced at 37 °C, 220 rpm with 0.1 mM IPTG for 6 h. Cells 
were harvested by centrifugation, washed in 100 mM pH 7 sodium phosphate buffer and then 
re-suspended in 1 mL of 100 mM HGN in 100 mM pH 7 sodium phosphate buffer. Reactions 
were allowed to proceed for at least 36 h at 22 °C with gentle agitation. Reaction progress 
was monitored by TLC (1:1 EtOAc:Hexanes, Rf=0.5, nitrile; RfNXO, acid). Cells and other 
debris were removed by centrifugation and the treated with one volume methanol prior to 
lyophilization. The lyophilizate was re-suspended in methanol and treated with TMS- 
diazomethane (10 equivalents, 2 M solution in hexanes) until gas evolution ceased and 
yellow color persisted in order to prepare the methyl ester for GC analysis. Selected nitrilase 
variants producing (R)-(-)-3-hydroxy-4-cyanobutyric acid of 95% ee or greater were then 
evaluated for performance at 2.25 M HGN. 

Nitrilase Activity Assay at 2.25 M 3 -HGN: 

3 - HGN (0.2 g, 1.8 mmol, 3 M) was suspended in sodium phosphate buffer (0.6 mL, pH 7, 
100 mM) at 22 °C. Cell lysate (6 mg, normalized for nitrilase content) was added to bring the 
concentration to 11 mg/ml enzyme and the reaction shaken (100 rpm, 22 °C). Reaction 
progress was monitored by TLC (1:1 ethyl acetate :hexanes, Rf = 0.32, nitrile; Rf = 0.0 3 acid) 
The reaction mixture was treated with one part methanol prior to lyophilization. The 
lyophilizate was re-suspended in methanol and treated with 10 equivalents of TMS- 
diazomethane (10 equivalents, 2 M solution in hexanes) to prepare the methyl ester and 
analyzed by GC. 

Description of novel high throughput LC/MS method for screening high numbers of samples: 
Ultra High-throughput Primary Chiral Activity Screen: 

Distinct members of the GSSM library were arrayed into 384 well plates 
containing 40 [iL of (Luria-Bertani) LB medium (100 pg/mL ampicillin) via an automated 
colony picker and then incubated at 37 °C, 85% humidity. Nitrilase expression was induced 
with 0.1 mM IPTG at 37 °C for 24 h. Each plate was replicated and 20% glycerol stocks 
prepared for archival at -80 °C. To each 384 well plate was added 10 mM 15N-(R)-1 
substrate. The plates were incubated at 37 °C, 85% humidity for three days. Cells and other 
debris were removed by centrifugation and the supernatant was diluted 17,576-fold prior to 
MS analysis. 
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LC/MS ionspray was applied for high through-put analysis in the following 
manner. High-throughput screening was achieved by flow injecting samples from 384-well 
plates using a CTCPAL autosampler (Leap Technologies, Carrboro, N.C.). An an isocratic 
mixture of 71% acetonitrile, 29% water, with 0.1% formic acid, provided by LC-lOADvp 
pumps (Shimadzu, Kyoto, Japan) at 2.2 mL/min through an LC-18 cartridge (Supelco, 
Bellefonte, PA) was used. Samples were applied to an API 4000 Turbolon spray triple- 
quadrupole mass spectrometer (Applied Biosystems, Foster City, CA). Ion spray and 
Multiple Reaction Monitoring (MRM) were performed for analytes in the negative ion mode, 
and each analysis took 60 seconds. 

E. coli transformed with wild type enzyme (SEQ ID NOS:209 and 210) was 
used as a positive activity control and E. coli transformed with empty vector was used as the 
negative activity control. The % ee of the WT enzyme positive control determined by mass 
spectrometry using either 15N-(R)-1 or 15N-(S)-1 were the same, thus demonstrating the 
absence of a significant isotope effect. 



Temp (°C) 


PH 


Sodium phosphate buffer 
cone. (mM) 


%ee 


Std. Dev. 


4 


7 


100 


98.7 


0.1 % 


19 


7 


100 


98.7 


0.1 % 


21 


7 


100 


98.6 


0.1 % 


37 


7 


100 


98.4 


0.1 % 


21 


7 


100 


98.6 


0.1 % 


21 


6 


100 


98.6 


0.1 % 


21 


8 


100 


98.6 


0.1 % 


21 


7 


100 


98.5 


0.1 % 


21 


7 


50 


98.6 


0.1 % 


21 


7 


25 


98.7 


0.1 % 



Effect of reaction parameters on SEQ ID NOS:209 and 210 with the A190H mutation. 

Reactions were performed at 3 M HGN concentration with 150 mg/ml protein 
(-49 mg/ml enzyme). % ee was determined by GC analysis in triplicate runs. 



20 

[000518] While the invention has been described in detail with reference to certain 
preferred aspects thereof, it will be understood that modifications and variations are within 
the spirit and scope of that which is described and claimed. 

25 
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What is claimed is: 

1 . An isolated or recombinant nucleic acid comprising nucleotides having a sequence at 
least 50% identical to SEQ ID NO: 195, 205, 207, 209, or 237, variants of SEQ ID 
NO:195, 205, 207, 209, or 237, having one or more mutations: at positions 163-165 
AAA, AAG, GGT, GGC, GGA, GGG, CAA, or CAG; at positions 178-180 GAA or 
GAG; at positions 331-333 TCT, TCC, TCA, TCG, AGT, or AGC; at positions 568- 
570 CAT, CAC, TCT, TCC, TCA, TCG, AGT, AGC, ACT, ACC, ACA, TCA, TAT, 
TAG, ATG or ACG; at positions 571-573 TTA, TTG, CTT, CTC, CTA, CTG, GTT, 
GTC, GTA, GTG, ATG, ACT, ACC, ACA, GAT, GAC, GGT, GGC, GGA, GGG, 
GAA, GAG, TAT, TAC, or ACG; at positions 595-597 GAA, GAG, TTA, TTG, CTT, 
CTC, CTA, or CTG; at positions 664-666 TTA, TTG, CTT, CTC, CTA, or CTG; or any 
combination thereof, fragments thereof, wherein the nucleic acid or fragment encodes a 
polypeptide having nitrilase activity, or their complements. 

2. The isolated or recombinant nucleic acid of claim 1, wherein the nucleic acid comprises 
nucleotides having a sequence substantially identical to the SEQ ID NO: 195, 205, 207, 
209, or 237, or variants of SEQ ID NO: 195, 205, 207, 209, or 237, having one or more 
mutations: at positions 163-165 AAA, AAG, GGT, GGC, GGA, GGG, CAA, or CAG; 
at positions 178-180 GAA or GAG; at positions 331-333 TCT, TCC, TCA, TCG, AGT, 
or AGC; at positions 568-570 CAT, CAC, TCT, TCC, TCA, TCG, AGT, AGC, ACT, 
ACC, ACA, TCA, TAT, TAC, ATG or ACG; at positions 571-573 TTA, TTG, CTT, 
CTC, CTA, CTG, GTT, GTC, GTA, GTG, ATG, ACT, ACC, ACA, GAT, GAC, GGT, 
GGC, GGA, GGG, GAA, GAG, TAT, TAC, or ACG; at positions 595-597 GAA, 
GAG, TTA, TTG, CTT, CTC, CTA, or CTG; at positions 664-666 TTA, TTG, CTT, 
CTC, CTA, or CTG; or any combination thereof, or their complements. 

3. An isolated or recombinant nucleic acid comprising nucleotides having a sequence 
identical to the SEQ ID NO:195, 205, 207, 209, or 237, fragments having nitrilase 
activity, or their complements. 

4. An isolated or recombinant nucleic acid comprising nucleotides having a sequence 
identical to a variant of SEQ ID NO: 195, 205, 207, 209, or 237, having one or more 
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mutations: at positions 163-165 AAA, AAG, GGT, GGC, GGA, GGG, CAA, or CAG; 
at positions 178-180 GAA or GAG; at positions 331-333 TCT, TCC, TCA, TCG, AGT, 
or AGC; at positions 568-570 CAT, CAC, TCT, TCC, TCA, TCG, AGT, AGC, ACT, 
ACC, ACA, TCA, TAT, TAC, ATG or ACG; at positions 571-573 TTA, TTG, CTT, 
5 CTC, CTA, CTG, GTT, GTC, GTA, GTG, ATG, ACT, ACC, ACA, GAT, GAC, GGT, 

GGC, GGA, GGG, GAA, GAG, TAT, TAC, or ACG; at positions 595-597 GAA, 
GAG, TTA, TTG, CTT, CTC, CTA, or CTG; at positions 664-666 TTA, TTG, CTT, 
CTC, CTA, or CTG; or any combination thereof, fragments having nitrilase activity, or 
their complements. 

10 

5. An isolated or recombinant nucleic acid that hybridizes to a nucleic acid of SEQ ID 
NO:195, 205, 207, 209, or 237, or variants of SEQ ID NO:195, 205, 207, 209, or 237, 
having one or more mutations: at positions 163-165 AAA, AAG, GGT, GGC, GGA, 
GGG, CAA, or CAG; at positions 178-180 GAA or GAG; at positions 331-333 TCT, 

1 5 TCC, TCA, TCG, AGT, or AGC; at positions 568-570 CAT, CAC, TCT, TCC, TCA, 

TCG, AGT, AGC, ACT, ACC, ACA, TCA, TAT, TAC, ATG or ACG; at positions 
571-573 TTA, TTG, CTT, CTC, CTA, CTG, GTT, GTC, GTA, GTG, ATG, ACT, 
ACC, ACA, GAT, GAC, GGT, GGC, GGA, GGG, GAA, GAG, TAT, TAC, or ACG; 
at positions 595-597 GAA, GAG, TTA, TTG, CTT, CTC, CTA, or CTG; at positions 

20 664-666 TTA, TTG, CTT, CTC, CTA, or CTG; or any combination thereof, fragments 

having nitrilase activity, or their complements. 

6. The isolated or recombinant nucleic acid of claim 5, wherein the stringent conditions 
comprise at least 50% formamide, and about 37°C to about 42°C. 

25 

7. A nucleic acid probe comprising from about 15 nucleotides to about 50 nucleotides, 
wherein at least 15 consecutive nucleotides are at least 50% complementary to a nucleic 
acid target region within a nucleic acid sequence of SEQ ID NO:195, 205, 207, 209, or 
237, or variants of SEQ ID NO: 195, 205, 207, 209, or 237, having one or more 

30 mutations: at positions 163-165 AAA, AAG, GGT, GGC, GGA, GGG, CAA, or CAG; 

at positions 178-180 GAA or GAG; at positions 331-333 TCT, TCC, TCA, TCG, AGT, 
or AGC; at positions 568-570 CAT, CAC, TCT, TCC, TCA, TCG, AGT, AGC, ACT, 
ACC, ACA, TCA, TAT, TAC, ATG or ACG; at positions 571-573 TTA, TTG, CTT, 
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CTC, CTA, CTG, GTT, GTC, GTA, GTG, ATG, ACT, ACC, ACA, GAT, GAC, GGT, 
GGC, GGA, GGG, GAA, GAG, TAT, TAC, or ACG; at positions 595-597 GAA, 
GAG, TTA, TTG, CTT, CTC, CTA, or CTG; at positions 664-666 TTA, TTG, CTT, 
CTC, CTA, or CTG; or any combination thereof, or their complements. 

5 

8. A nucleic acid probe comprising at least 15 consecutive nucleotides of a nucleic acid 
target region within a nucleic acid sequence of SEQ ID NO: 195, 205, 207, 209, or 237, 
or variants of SEQ ID NO: 195, 205, 207, 209, or 237, having one or more mutations: at 
positions 163-165 AAA, AAG, GGT, GGC, GGA, GGG, CAA, or CAG; at positions 

10 178-180 GAA or GAG; at positions 331-333 TCT, TCC, TCA, TCG, AGT, or AGC; at 

positions 568-570 CAT, CAC, TCT, TCC, TCA, TCG, AGT, AGC, ACT, ACC, ACA, 
TCA, TAT, TAG, ATG or ACG; at positions 571-573 TTA, TTG, CTT, CTC, CTA, 
CTG, GTT, GTC, GTA, GTG, ATG, ACT, ACC, ACA, GAT, GAC, GGT, GGC, 
GGA, GGG, GAA, GAG, TAT, TAG, or ACG; at positions 595-597 GAA, GAG, TTA, 

15 TTG, CTT, CTC, CTA, or CTG; at positions 664-666 TTA, TTG, CTT, CTC, CTA, or 

CTG; or any combination thereof, or their complements. 

9. A nucleic acid vector capable of replication in a host cell, wherein the vector comprises 
the nucleic acid of any one of claims 1 to 6, 12, or 13. 

20 

10. A host cell comprising the nucleic acid of any one of claims 1 to 6, 12 or 13. 

1 1. A host organism comprising the host cell of claim 10. 

25 12. An isolated or recombinant nucleic acid encoding a polypeptide comprising amino acids 
having a sequence at least 50% identical to SEQ ID NO: 196, 206, 208, 210 or 238, or 
variants of SEQ ID NO: 196, 206, 208, 210 or 238, having one or more mutations: at 
residue 55 lysine, glycine, or glutamine; at residue 60 glutamic acid; at residue 111 
serine, at residue 190, serine, histidine, tyrosine or threonine; at residue 191, leucine, 

30 valine, methionine, aspartic acid, glycine, glutamic acid, tyrosine or threonine; at 

residue 199 glutamic acid or leucine; at residue 222 leucine; or any combination 
thereof, fragments encoding polypeptides wherein the polypeptides have nitrilase 
activity, or its complement. 
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13. An isolated or recombinant nucleic acid encoding a polypeptide comprising amino acids 
having a sequence of SEQ ID NO: 196, 206, 208, 210 or 238, or variants of SEQ ID 
NO: 196, 206, 208, 210 or 238, having one or more mutations: at residue 55 lysine, 
glycine, or glutamine; at residue 60 glutamic acid; at residue 111 serine, at residue 190, 
serine, histidine, tyrosine or threonine; at residue 191, leucine, valine, methionine, 
aspartic acid, glycine, glutamic acid, tyrosine or threonine; at residue 199 glutamic acid 
or leucine; at residue 222 leucine; or any combination thereof, fragments encoding a 
polypeptides having nitrilase activity, or its complement. 

14. The isolated or recombinant nucleic acid of any one of claims 1 to 6, 12 or 13, wherein 
the nucleic acid is affixed to a solid support. 

15. The isolated or recombinant nucleic acid of claim 14, wherein the solid support is 
selected from the group of a gel, a resin, a polymer, a ceramic, a glass, a microelectrode 
and any combination thereof 

16. An isolated or recombinant polypeptide comprising amino acids having a sequence at 
least 50% identical to SEQ ID NO:196, 206, 208, 210 or 238, or variants of SEQ ID 
NO: 196, 206, 208, 210 or 238, having one or more mutations: at residue 55 lysine, 
glycine, or glutamine; at residue 60 glutamic acid; at residue 111 serine, at residue 190, 
serine, histidine, tyrosine or threonine; at residue 191, leucine, valine, methionine, 
aspartic acid, glycine, glutamic acid, tyrosine or threonine; at residue 199 glutamic acid 
or leucine; at residue 222 leucine; or any combination thereof, or fragments thereof, 
wherein the polypeptide has a nitrilase activity. 

17. An isolated or recombinant polypeptide comprising amino acid having SEQ ID 
NO:196, 206, 208, 210 or 238, or variants of SEQ ID NO:196, 206, 208, 210 or 238, 
having one or more mutations: at residue 55 lysine, glycine, or glutamine; at residue 60 
glutamic acid; at residue 111 serine, at residue 190, serine, histidine, tyrosine or 
threonine; at residue 191, leucine, valine, methionine, aspartic acid, glycine, glutamic 
acid, tyrosine or threonine; at residue 199 glutamic acid or leucine; at residue 222 
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leucine; or any combination thereof, or fragments thereof, wherein the polypeptide has 
nitrilase activity. 



18. The isolated or recombinant polypeptide of any one of claims 16 or 17, wherein the 
5 fragment is at least 20 amino acids in length, and wherein the fragment has nitrilase 

activity. 

19. A peptidomimetic of the polypeptide of claim 16 or claim 17 or fragments thereof 
having a nitrilase activity, 

10 

20. A codon-optimized polypeptide of the polypeptide of claim 16 or claim 17, or 
fragments thereof, having a nitrilase activity, wherein the codon usage is optimized for 
a particular organism or cell. 

15 21 . The polypeptide of claim 16 or claim 17 or fragments thereof, having a nitrilase 
activity, or a peptidomimetic thereof having a nitrilase activity, wherein the 
polypeptide, fragment, or peptidomimetic is affixed to a solid support. 

22. The polypeptide of claim 21, wherein the solid support is selected from the group 
20 consisting of a gel, a resin, a polymer, a ceramic, a glass, a microelectrode and any 

combination thereof. 

23. A purified antibody that specifically binds to the polypeptide of claim 16 or claim 17 or 
fragments thereof, having a nitrilase activity. 

25 

24. A fragment of the antibody of claim 23, wherein the fragment specifically binds to a 
polypeptide having a nitrilase activity. 

25. An enzyme preparation which comprises at least one of the polypeptides of any one of 
30 claims 16 and 17, wherein the preparation is liquid or dry. 

26. The enzyme preparation of claim 25, wherein the preparation is affixed to a solid 
support. 
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27. A composition comprising at least one nucleic acid of claims 1 to 6, 12, or 13 or 
comprising at least one polypeptide of claim 16 or claim 17 or fragments thereof, or a 
peptidomimetic thereof, having nitrilase activity, or any combination thereof. 

5 

28. A method for hydrolyzing a nitrile to a carboxylic acid comprising contacting the 
molecule with at least one polypeptide of claim 16 or claim 17 or fragments thereof, or 
a peptidomimetic thereof, having nitrilase activity, under conditions suitable for 
nitrilase activity. 

10 

29. A method for hydrolyzing a cyanohydrin moiety or an aminonitrile moiety of a 
molecule, the method comprising contacting the molecule with at least one polypeptide 
of any one of claim 16 or claim 17 or fragments thereof, or a peptidomimetic thereof, 
having nitrilase activity, under conditions suitable for nitrilase activity. 

15 

30. A method for making a chiral alpha-hydroxy acid molecule or a chiral amino acid 
molecule, the method comprising admixing a molecule having a cyanohydrin moiety or 
an aminonitrile moiety with at least one polypeptide having an amino acid sequence at 
least claim 16 or claim 17 or fragments thereof, or a peptidomimetic thereof, having 

20 enantio-selective nitrilase activity. 

31. A method for making a composition or an intermediate thereof, the method comprising 
admixing a precursor of the composition or intermediate, wherein the precursor 
comprises a cyanohydrin moiety or an aminonitrile moiety, with at least one 

25 polypeptide of any one of claim 16 or claim 17 or fragments thereof or peptidomimetic 

thereof having nitrilase activity, hydrolyzing the cyanohydrin or the aminonitrile moiety 
in the precursor thereby making the composition or the intermediate thereof. 

32. A method for making an (R) -ethyl 4-cyano-3 -hydroxybutyric acid, the method 

30 comprising contacting a hydroxyglutaryl nitrile with at least one polypeptide encoded 

by a nucleic acid having a sequence of SEQ ID NO:195, 205, 207, 209, OR 237, or a 
variant ofSEQ ID NO: 195, 205, 207, 209, or 237, having one or more mutations: at 
positions 163-165 AAA, AAG, GGT, GGC, GGA, GGG, CAA, or GAG; at positions 
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178-180 GAA or GAG; at positions 331-333 TCT, TCC, TCA, TCG, AGT, or AGC; at 
positions 568-570 CAT, CAC, TCT, TCC, TCA, TCG, AGT, AGC, ACT, ACC, ACA, 
TCA, TAT, TAG, ATG or ACG; at positions 571-573 TTA, TTG, CTT, CTC, CTA, 
CTG, GTT, GTC, GTA, GTG, ATG, ACT, ACC, ACA, GAT, GAC, GGT, GGC, 
5 GGA, GGG, GAA, GAG, TAT, TAC, or ACG; at positions 595-597 GAA, GAG, TTA, 

TTG, CTT, CTC, CTA, or CTG; at positions 664-666 TTA, TTG, CTT, CTC, CTA, or 
CTG; or any combination thereof, or a fragment thereof encoding a polypeptide having 
nitrilase activity, that selectively produces an (K>enantiomer, so as to make (2?>ethyl 4- 
cyano-3-hydroxybutyric acid. 

10 

33. A method for making an (S)-ethyl 4-cyano-3 -hydroxybutyric acid, the method 
comprising contacting a hydroxyglutaryl nitrile with at least one polypeptide having an 
amino acid sequence of any one of SEQ ID NO:196, 206, 208, 210 or 238, or a variant 
of SEQ ID NO:196, 206, 208, 210 or 238, having one or more mutations: at residue 55 

15 lysine, glycine, or glutamine; at residue 60 glutamic acid; at residue 111 serine, at 

residue 190, serine, histidine, tyrosine or threonine; at residue 191, leucine, valine, 
methionine, aspartic acid, glycine, glutamic acid, tyrosine or threonine; at residue 199 
glutamic acid or leucine; at residue 222 leucine; or any combination thereof, or a 
fragment or peptidomimetic thereof having nitrilase activity that selectively produces an 

20 (S^-enantiomer, so as to make fS)-ethyl 4-cyano- 3 -hydroxybutyric acid. 

34. A method for making an (7?>mandelic acid, the method comprising admixing a 
mandelonitrile with at least one polypeptide having an amino acid sequence of any one 
of SEQ ID NO:196, 206, 208, 210 or 238, or a variant of SEQ ID NO:196, 206, 208, 

25 210 or 238, having one or more mutations: at residue 55 lysine, glycine, or glutamine; 

at residue 60 glutamic acid; at residue 111 serine, at residue 190, serine, histidine, 
tyrosine or threonine; at residue 191, leucine, valine, methionine, aspartic acid, glycine, 
glutamic acid, tyrosine or threonine; at residue 199 glutamic acid or leucine; at residue 
222 leucine; or any combination thereof, or any fragment or peptidomimetic thereof 

30 having nitrilase activity. 

35. A method for making an f$-mandelic acid, the method comprising admixing a 
mandelonitrile with at least one polypeptide having an amino acid sequence of SEQ ID 
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NO:196, 206, 208, 210 or 238, or a variant of SEQ ID NO: 196, 206, 208, 210 or 238, 
having one or more mutations: at residue 55 lysine, glycine, or glutamine; at residue 60 
glutamic acid; at residue 111 serine, at residue 190, serine, histidine, tyrosine or 
threonine; at residue 191, leucine, valine, methionine, aspartic acid, glycine, glutamic 
acid, tyrosine or threonine; at residue 199 glutamic acid or leucine; at residue 222 
leucine; or any combination thereof, or any fragment or peptidomimetic thereof having 
nitrilase activity. 

36. A method for making an (^-phenyl lactic acid derivative or an (2?>phenyl lactic acid 
derivative, the method comprising admixing a phenyllactocyanonitrile with at least one 
polypeptide selected from SEQ ID NO: 196, 206, 208, 210 or 238, or a variant of SEQ 
ID NO:196, 206, 208, 210 or 238, having one or more mutations: at residue 55 lysine, 
glycine, or glutamine; at residue 60 glutamic acid; at residue 111 serine, at residue 190, 
serine, histidine, tyrosine or threonine; at residue 191, leucine, valine, methionine, 
aspartic acid, glycine, glutamic acid, tyrosine or threonine; at residue 199 glutamic acid 
or leucine; at residue 222 leucine; or any combination thereof, or any fragment or 
peptidomimetic thereof having nitrilase activity, or any active fragment or 
peptidomimetic thereof that selectively produces an fSj-enautiomer or an (R)~ 
enantiomer, thereby producing an (^-phenyl lactic acid derivative or an (7?) -phenyl 
lactic acid derivative. 

37. A method for making the polypeptide of claim 16 or claim 17 or fragments thereof, the 
method comprising 

(a) introducing a nucleic acid encoding the polypeptide into a host cell under conditions 
that permit production of the polypeptide by the host cell, and 

(b) recovering the polypeptide so produced. 

38. A method for generating a nucleic acid variant encoding a polypeptide having nitrilase 
activity, wherein the variant has an altered biological activity from that which naturally 
occurs, the method comprising 

(a) modifying the nucleic acid of any one of claims 1 to 6, 12, or 13 by 

(i) substituting one or more nucleotides for a different nucleotide, wherein 
the nucleotide comprises a natural or non-natural nucleotide; 

146 



WO 03/097810 PCT/US03/15712 

(ii) deleting one or more nucleotides, 

(iii) adding one or more nucleotides, or 

(iv) any combination thereof. 



39. A method for making a polynucleotide from two or more nucleic acids, the method 
comprising: 

(a) identifying regions of identity and regions of diversity between two or more nucleic 
acids, wherein at least one of the nucleic acids comprises a nucleic acid of any one of 
claims 1 to 6, 12, or 13; 

(b) providing a set of oligonucleotides which correspond in sequence to at least two of the 
two or more nucleic acids; and, 

(c) extending the oligonucleotides with a polymerase, thereby making the polynucleotide. 

40. A screening assay for identifying a nitrilase, the assay comprising: 

(a) providing a plurality of nucleic acids or polypeptides comprising at least one of the 
nucleic acids of any one of claims 1 to 6, 12, or 13, or at least one of the polypeptides of 
claim 16 or claim 17 or fragments thereof; 

(b) obtaining polypeptide candidates to be tested for nitrilase activity from the plurality; 

(c) testing the candidates for nitrilase activity; and 

(d) identifying those polypeptide candidates which are nitrilases. 

41. A kit comprising (a) the nucleic acid of any one of claims 1 to 6, 12, or 13, or a 
fragment thereof encoding a polypeptide having nitrilase activity, or (b) the polypeptide 
of any one of claim 16 or claim 17 or fragments thereof, or a peptidomimetic thereof 
having nitrilase activity, or a combination thereof; and (c) a buffer. 



42. A method for modifying a molecule comprising: 

(a) mixing a polypeptide of any one of claim 16 or claim 17 or fragments thereof, or 
peptidomimetic thereof having nitrilase activity, with a starting molecule to produce a 
reaction mixture; 

(b) reacting the starting molecule with the polypeptide to produce the modified 
molecule. 
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43. A method for identifying a modified compound comprising: 

(a) admixing a polypeptide of any one of claim 16 or claim 17 or fragments thereof, 
or peptidomimetic thereof having nitrilase activity, with a starting compound to produce a 
reaction mixture and thereafter a library of modified starting compounds; 
5 (b) testing the library to determine whether a modified starting compound is present 

within the library which exhibits a desired activity; 

(c) identifying the modified compound exhibiting the desired activity. 

44. A computer readable medium having stored thereon at least one nucleotide sequence 
10 selected from the group consisting of: SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 

23, 25, 27, 29, 31, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 
71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 
113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 
147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 

15 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 

215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 
249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 
283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 
317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 

20 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383 

385, or variants thereof, and/or at least one amino acid sequence selected from the 
group consisting of: SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 
34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 
80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 

25 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 

154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 
188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 
222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 
256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 

30 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 

324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 
358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384 and 386, and 
variants thereof. 
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45. A computer system comprising a processor and a data storage device, wherein the data 
storage device has stored thereon at least one nucleotide sequence selected from the 
group consisting of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 

5 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 

81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 
121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 
155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 
189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 

10 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 

257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 
291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 323, 
325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 
359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, and variants 

15 thereof and/or at least one amino acid sequence selected from the group consisting of: 

SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 
44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 
90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 
128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 

20 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 

196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 
230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 
264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 
298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 

25 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 

366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, and variants thereof. 

46. A method for identifying a feature in a sequence which comprises: 

(a) inputting the sequence into a computer; 
30 (b) running a sequence feature identification program on the computer so as to 

identify a feature within the sequence; and 
(c) identifying the feature in the sequence, 
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47. An assay for identifying a functional fragment of a polypeptide which comprises: 
5 (a) obtaining a fragment of at least one polypeptide of claim 16 or claim 17; 

(b) contacting at least one fragment from step (a) with a substrate having a 
cyanohydrin moiety or an aminonitrile moiety under reaction conditions suitable for 
nitrilase activity; 

(c) measuring the amount of reaction product produced by each at least one fragment 
10 from step (b); and 

(d) identifying the at least one fragment which is capable of producing a nitrilase 
reaction product; thereby identifying a functional fragment of the polypeptide. 

48. An assay for identifying a functional variant of a polypeptide which comprises: 

15 (a) obtaining at least one variant of at least one polypeptide of claim 16 or claim 17; 

(b) contacting at least one variant from step (a) with a substrate having a cyanohydrin 
moiety or an aminonitrile moiety under reaction conditions suitable for nitrilase activity; 

(c) measuring the amount of reaction product produced by each at least one variant 
from step (b); and 

20 (d) identifying the at least one variant which is capable of producing a nitrilase 

reaction product; thereby identifying a functional variant of the polypeptide. 

49. An assay for screening enantioselective transformation comprising: 

(a) labeling one of two prochiral or enantiotopic moieties in a molecule; 

(b) modifying at least one of the two moieties by selective catalyst to produce 
25 products; and 

(c) determining the resultant products by mass spectroscopy. 

50. The assay of claim 49, wherein the label is heavier or lighter isotope. 

30 51. The assay of claim 49, wherein the selective catalyst is an enzyme. 

52. The assay of claim 49, wherein the use of mass spectroscopy is by a positive mode or 
a negative mode. 
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53. The assay of claim 49, wherein the analysis is of either a parental mass or a 
fragmentation mass. 

54. The assay of claim 49, wherein the assay can be used to monitor or determine % 
enantiomeric excess or % diasteromeric excess. 

55. An isolated or recombinant polypeptide having a nitrilase activity comprising a 
sequence as set forth in SEQ ID NO: 196, 206, 208, 210 or 238 and having one or 
more mutations selected from the group consisting of a mutation at residue 55 lysine, 
residue 55 glycine, residue 55 glutamine, residue 60 glutamic acid, residue 111 serine, 
residue 190, residue 190 serine, residue 190 histidine, residue 190 tyrosine, residue 
190 threonine, residue 191 leucine, residue 191 valine, residue 191 methionine, 
residue 191 aspartic acid, residue 191 glycine, residue 191 glutamic acid, residue 191 
tyrosine, residue 191 threonine, residue 199 glutamic acid, residue 199 leucine, 
residue 222 leucine, and any combination thereof. 

56. An isolated or recombinant polypeptide having a nitrilase activity comprising a 
sequence as set forth in SEQ ID NO: 196, 206, 208, 210 or 238 and having a mutation 
at residue 190 or equivalent, wherein alanine is replaced with a hydrogen-binding 
amino acid or peptidomimetic residue. 

57. An isolated or recombinant polypeptide having a nitrilase activity comprising a 
sequence as set forth in SEQ ID NO: 196, 206, 208, 210 or 238 and having a mutation 
at residue 190 or equivalent, wherein alanine is replaced with a hydrophobic amino 
acid or peptidomimetic residue. 

58. An isolated or recombinant nitrilase having the equivalent of one or more mutations at 
residue 55 lysine, glycine, or glutamine; at residue 60 glutamic acid; at residue 111 
serine, at residue 190, serine, histidine, tyrosine or threonine; at residue 191, leucine, 
valine, methionine, aspartic acid, glycine, glutamic acid, tyrosine or threonine; at 
residue 199 glutamic acid or leucine; at residue 222 leucine of SEQ ID NO: 196, 206, 
208, 210 or 238. 
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59. An amplification primer pair for amplifying a nucleic acid encoding a polypeptide 
having a nitrilase activity, wherein the primer pair is capable of amplifying a nucleic 
acid comprising a sequence as set forth in claim 1, or a subsequence thereof 

60. The amplification primer pair of claim 59, wherein a member of the amplification 
primer sequence pair comprises an oligonucleotide comprising at least about 10 to 50 
consecutive bases of the sequence, or, about 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 
23, 24, 25, 26, 27, 28, 29, 30 or more consecutive bases of the sequence. 

61 . A nitrilase-encoding nucleic acid generated by amplification of a polynucleotide using 
an amplification primer pair as set forth in claim 59. 

62. The nitrilase-encoding nucleic acid of claim 61, wherein the amplification is by 
polymerase chain reaction (PGR). 

63. The nitrilase-encoding nucleic acid of claim 62, wherein the nucleic acid generated by 
amplification of a gene library. 

64. The nitrilase-encoding nucleic acid of claim 63, wherein the gene library is an 
environmental library. 

65. An isolated or recombinant nitrilase encoded by a nitrilase-encoding nucleic acid as set 
forth in claim 61. 
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Figure 1 
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Figure 2 
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Figure 3 
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Figure 4 
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Figure 5 
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Figure 6A 
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FIGURE 14A 



Heavy-Acid Std.Curves 




y*283620x* 139977 
R*« 0.9643 

y « 7394.4X + 14405 
R*» 0.9935 

-*-M-=128 
-■-M-129 

Linear (M- 

=128) 
— Linear (M- 



FIGURE 14B 



Lite-Acid Std.Curve 



soooooo 




y ■ 419156k ♦ 115560 
ft* •0.9946 

y ■ 17819« ♦ 9477 4 
R» a 0.9937 

-•-M-*128 

-•-M--129 

Linear (M- 

=128) 
— Linear (M- 



18/42 



WO 03/097810 



Figure 15 



PCT/US03/15712 



HoN 




-COOH 

SH' f 

D-thienyl glycine 



num. 



NH 2 

! 



■COOH 



HS 

D-thienyl alanine 




•COOH 
D-2-amino butyrate 




OH 



COOH 



(R)-3-hydroxy-cyanoglutaric acid 
side chain for LBPITOR™, Baycol 



,COOH 



,COOf 



(S)-3-hydroxy-cyanoglutaric acid 
side chain for Lescol 



H 2 N 



beta-alanine 




i ^NH 2 



COOH 



D-methyl phenylglycine 




COOH 



L-methyl phenylglycine 



H 



'COOH 



Sarcosine 



19/42 



WO 03/097810 



Figure 15 



PCT/US03/15712 




Pantenol 




CI' 
S-Plavix 



,COOH 



*COOH 



COOH 
EDTA 



20/42 



WO 03/097810 



Figure 15 



PCT/US03/15712 



HO 



•? S-Ditropan 



f 



-NH 



HO 




COOH 



Cynomel 



NH 2 



*COOH 



L-homophenyl alanine 



21/42 



WO 03/097810 



PCT/US03/15712 



Figure 15 




COOH 



H 

Sarcosine 



HO. 




COOH 
alpha-methyl-L-tyrosine 



gOOH 




Cuprimine/Depen 



22/42 



WO 03/097810 



Figure 15 



PCT/US03/15712 




I 




Levothroid 




23/42 



WO 03/097810 



Figure 15 



PCT/US03/15712 




24/42 



WO 03/097810 



Figure 15 



PCT/US03/15712 




3 -(2-napthyl)-L-alanine 




25/42 



WO 03/097810 PCT/US03/15712 

Figure 15 nh 2 




L-isoleucine 



26/42 



WO 03/097810 



PCT/US03/15712 



Figure 15 



NH 2 



"COOH 




COOH 



L-norleucine 



D -nor leucine 



NH 2 



"COOH 




COOH 



L-norv aline 



D-norvaline 



27/42 



WO 03/097810 



PCT/US03/15712 



OH 



Figure 15 



COOH 



H 3 CO: 

methoxy-mandelic acid 

OH 
COOH 

4-phenyl-2-hydroxy-3,4-butenoic acid 
HO^ ^COOH 





1-napthyl glycolic acid 
OH 

2-pyridyl glycolic acid 



OH 




COOH 



OH 




\J C ° 0H 
2-furanyl glycolic acid 




HO 



OH 




COOH 



" N 

3-pyridyl glycolic acid 



OH 



j^J^COOH 
fluoro-mandelic acid 



U^OOOH 

OH 

3-furanyl glycolic acid 



COOH 



2-napthyl glycolic acid 



OH 

— ^ ^cz:cz>oi— i 

4-pyridyl glycolic acid 





chloro-mandelic acid bromo-mandelic acid 



28/42 



WO 03/097810 



Figure 15 



PCT/US03/15712 



gKJ COOH 



OH 



Z MJ COOH 



trifluomriethyphenyl lactic acid fluorophenyi lactic acid 



OH 
COOH 

Q2N 

nitro phenyl tactic acid 





c 



OH 
COOH 



methyl phenyl lactic acid 



I OH 




H(S kJ COOH 
hydroxy phenyl lactic acid 

HOOC ^OH 



OH 

"COOH y \^ "COOH 

2-hydroxy heptanoic acid 2-4iych)xy^-fTTe*hyliDentarwic acid 
OH 

^J^-^COOH 
4^Drtenyl-24iydrco<ypheriyl butancic acid pyridyl acetic add 



Or^J COOH 
brornophenyl lactic acid 



COOH 




phenyl lactic acid 



ct MJ COOH 
chlorophenyl lactic acid 



OH 

"COOH 
thienyl acetic acid 



29/42 



WO 03/097810 



PCT/US03/15712 



OH 



.COOH 



3-hydroxy-cyano gluteric acid 
side chain for LIP1TOR, Baycol 

HN^COOH 
^COOH 
iminodiacetic acid 



.COOH 
CQOH 



Figure 15 
OH 

NC^A^COOH 

3-hydroxy-cyano gluteric acid 
side chain for Lescol 





OH 
Pantenol 



H 2 N— COOH 
beta-alanine 

COOMe 





cr 

S-Plavix 



^COOH 
Sarcosine 



H 



S-Ditropan 



30/42 



WO 03/097810 



Figure 15 



O Me 




Fenoprofen 



Me Flunoxaprofen 




O Loxoprofen 



31/42 




32/42 



WO 03/097810 PCT/US03/15712 

Figure 15 



COOH 



alpha-methyl benzyl cyanide derivatives 



^x OH 

COOH 



intermediate for Trocade 




^T^COOH 
H 

3-methyl-2-carboxy-piperidine 



o 



^N^*COOH 
H 

Fospril 



33/42 



WO 03/097810 PCT/US03/15712 

Figure 15 

H 

2-carboxy-cyclobutyl amine 



a. 



"N ^COOH 
H 



2-carboxy-piperidine 



CI 



~N ^COOH 
H 



2-carboxy-piperazine 



34/42 



WO 03/097810 



Figure 16 



PCT/US03/15712 



OH 



4-bromo mandelonitrile 



OH 



OH 




COOH 



Sr" ^ Br' 
(R)-4-bromo mandelic acid (S)-4-bromo mandelic acid 



CI OH 

2-chloro mandelonitrile 
OH 



0 2 N 

4-nitro mandelonitrile 
OH 
"CN 



4-methyl mandelonitrile 



CI OH 




CI OH 



COOH 




COOH 



(R)-2-chloro mandelic acid (S)-2-chloro mandelic acid 

OH OH 
s COOH 

0 2 N^^ O z N 
(R)-4-nitro mandelic acid (S)-4-nitro mandelic acid 

OH QH 

*COOH (t'^' "COOH 



j^jr^~COOH 





(R)-4-methyl mandelic acid (S)-4-methyl mandelic acid 



NH 2 

phenyl glycinonitrile 



NH 2 
COOH 



(R)-phenyl glycine 



NH 2 

Q^cooh 
(S)-phenyl glycine 



NH 2 



"CN 

phenyl methyl glycinonitrile 





<R)-phenyl methyl glycine (S)-phenyl methyl glycine 



NH 2 

jX 0N 



4-fluoro phenyl glycinonitrile 



NH 2 




NH 2 
COOH 



F ^ F" 
(R)-4-fluoro phenyl glycine (S)-4-fluoro phenyl glycine 



35/42 



WO 03/097810 



Figure 16 



PCT/US03/15712 



OH 



3-phenyl propanal cyanohydrin 




COOH 



OH 




COOH 



OH 



(R)-2-hydroxy-4-phenyl butyric acid (S)-2-hydroxy-4-phenyl butyric 

acid 



CN 

2-amino hexanenitrile 



NH 2 NH 2 

"COOH "^^^^^COOH 
(R)-2-amino-hexanoic acid (S)-2-amino-hexanoic acid 



OH 

mandelonitrile 



OH 

QA COOH 

(R)-mandelic acid 



OH 



QT OOOH 

(S)-mandeJic acid 



OH OH OH 

0^ CN — - 0^°°°" OT COOH 

2-pyridyl formaldehyde cyanohydrin (R)-pyridyl glycolic acid (S)-pyridyl glycolic acid 



36/42 



WO 03/097810 



Figure 16 



PCT/US03/15712 



HO 



2-amino-6-hydroxy hexanenitrile 



NH 2 



(R)-2-amino-6-hydroxy hexanoic acid (S)— 2-amino-6-hydroxy hexanoic 

acid 



XY 

NH 2 



CN 



XY 

NH 2 



COOH 



XTY 

NH 2 



COOH 



2-amino-4,4-dimethyl pentanenitrile (R)-2-amino-4,4-dimethyl pentanoic acid (S)-2-amino-4,4-dimethyl pentanoic 

acid 



NC OH 




HOOC OH 




HOOO JDH 




cyclohexyl mandelonitrile 
OCH 3 



H 3 CO" 



(S)-cyciohexyi mandeiic acid 
OCH3 NH 2 



H 3 CO' 



"COOH 



(R)-cyclohexyi mandeiic acid 
OCH3 



H 3 CO" 



2-amino-6 t 6-dimethoxy hexanenitrile (R)-2-amino-6,6-dimethoxy hexanoic acid(S)-2-amino-6,6-dimethoxy hexanoic 

acid 



CN 
OH 



phenyl acetaldehyde cyanohydrin 



.COOH 
OH 

(R)-phenyt lactic acid 



cnr; 



.COOH 
OH 

(S)-phenyl lactic acid 



hydroxygtutarylnitrile 



OH QH 
NC^A^COOH NCv^A^COOH 
(R)-(3)-hydroxy-4-cyano-butyric acid (S)-(3)-hydroxy-4-cyano-butyric acid 
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2-methyl mandelonitrile 
OH 



3-chloro mandelonitrile 
OH 

Br- 




3-bromo mandelonitrile 



Figure 16 

Br OH 

j^jj^COOH 



Br OH 




COOH 



(R)-2-bromo mandelic acid (S)-2-bromo mandelic acid 

CH 3 OH 

-COOH 



CH 3 OH 




(R)-2-methyl mandelic acid (S)-2-methyl mandelic acid 

OH OH 

-COOH 



Gl ^CX^ COOH 



CI 




(R)-3-chloro mandelic acid (S)-3-chloro mandelic acid 
OH QH 

-COOH 



Br 



* Brv ^jr^ cooH 

(R)-3-bromo mandelic acid (S)-3-bromo mandelic acid 
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OH 

4-fluoro mandelonitrile 
OH 

"CN 



Figure 16 



OH 



OH 




COOH 




(R)-4-fluoro mandelic acid (S)-4-fluoro mandelic acid 
OH OH 
"COOH (i^^^ "COOH 





2-napthyl formaldehyde cyanohydrin (R)-2-napthylglycolic acid (S)-2-napthylglycolic acid 

OH OH OH 

"COOH r^V^ ^COOH 






1 -napthyl formaldehyde cyanohydrin 
OH 



(R)-1 -napthylglycolic acid (S)-1 -napthylglycolic acid 




OH 



CN 



OH 



N 



W J 

N 



COOH 



3-pyridyl formaldehyde cyanohydrin 
OH 



OT COOH 

N 

(R)-3-pyridyl glycolic acid (S)-3-pyridyl glycolic acid 



OH 



OH 



^^"^COOH 




COOH 



3-thienyl formaldehyde cyanohydrin (R)-3-thienyl glycolic acid (S)-3-thienyl glycolic acid 
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CN 



OH 



phenyl acetaldehyde cyanohydrin 





.COOH 
OH 

(R)-phenyl lactic acid 
COOH 




enr H 

(S)-phenyl lactic acid 
OH 



Br 



OH OH 

2-methyl phenyl acetaldehyde cyanohydrin (R)-2-methyl phenyl lactic acid (S)-2-methyl phenyl lactic acid 

Br Br 

^^^v^CN , ^L/\^COOH 

OH OH 

2-bromo phenyl acetaldehyde cyanohydrin (R)-2-bronno phenyl lactic acid (S)-2-bromo-phenyl lactic acid 

1 F \ •= 

/W^v^ 01 ^ ^ ^W^\^COOH 



^--^COOH 
OH 



OH OH 
2-fiuoro phenyl acetaldehyde cyanohydrin (R)-2-fluoro-phenyl lactic acid (S)-2-f!uoro-phenyl lactic acid 




KjJ OH 




CN 




COOH 



OH K^t> OH li^jJ OH 

3-fluoro phenyl acetaldehyde cyanohydrin (R)-3-fluoro-phenyl lactic acid (S)-3-fluoro-phenyl lactic acid 



COOH 
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H 3 C 




CN 



OH 



-methyl phenyl acetaldehyde cyanohydrin 

,CN 

OH 




1 -napthyl acetaldehyde cyanohydrin 



CN 



S— ' OH 
3-thiophene acetaldehyde cyanohydrin 



CN 



S' V 
OH 

2-thiophene acetaldehyde cyanohydrin 
,CN 




-N OH 
2-pyridine acetaldehyde cyanohydrin 

XN 




3-pyridine acetaldehyde cyanohydrin (R)-3-pyridine lactic acid 



OH 




COOH H ,C 




(R)-3-methyl-phenyl lactic acid 
.COOH 
OH 




COOH 
OH 

(S)-3-methyl-phenyl lactic acid 
-COOH 
OH 




(S)-1 -napthyl lactic acid 
.COOH 



(R)-1 -napthyl lactic acid 

^y^xooH 

5h s - oh 

(R)-3-thiophene lactic acid (S)-3-thiophene lactic acid 



err 



\^.COOH 
OH 



n 




COOH 



OH 



(R)-2-thiophene lactic acid (S)-2-thiophene lactic acid 

C^^COOH ^^^.COOH 
N OH J„ 
(R)-2-pyridine lactic acid (S)-2-pyridine lactic acid 




COOH 



COOH 



(S)-3-pyridine lactic acid 
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proR selective 
nitriiase — NC 




Hydroxyglutaryl nitrile (HGN) 



proS selectiv< 
nitriiase 



COOH 

(R)-4-cyano-3-hydroxy butyric acid 
OH 

NC> v-x A Vs ^COOH 
(S)-4-cyano-3-hydroxy butyric acid 
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SEQUENCE LISTING 



<110> DIVERSA CORPORATION 

<120> Nitrilases, Nucleic Acids Encoding Them and Methods for Making and 
Using Them 

<130> Docket No. 09010-120WO4 



<140> 
<141> 



Unknown 
2003-05-15 



<150> US 10/241,742 
<151> 2002-09-09 

<150> US 10/146,772 
<151> 2002-05-15 

<150> US 60/309,006 
<151> 2001-07-30 

<150> US 60/351,336 
<151> 2002-01-22 

<150> US 60/300, 189 
<151> 2001-06-21 

<150> US 09/751,299 
<151> 2000-12-28 

<150> US 60/254,414 
<151> 2000-12-07 

<150> US 60/173,609 
<151> 1999-12-29 

<160> 386 

<17 0> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 939 

<212> DNA 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 1 

atggaaaagt 

gtgaatatag 

attgctttgc 

attcagacgc 

gtattgatag 

atgtgttatg 

gaaaatgctt 

aatacgcgcg 

cttcgcaatc 

ttccccggtt 



atattaaagt 
aaaattgcga 
cggagttctt 
ctgacggcaa 
gaggctcatt 
ccaacggcgc 
tttatgaagg 
ttggtgcagc 
aggtggatgt 
ttgtgcaaaa 



cgccgcaatt 
acgtttggca 
caatacgggc 
ggctgcgatg 
tctgtgcagg 
tctcgtgggc 
tggggattcc 
cgtctgttgg 
catcatgggt 
gctgtgggaa 



cagatagcta 
ttatcggcgg 
gttagttgga 
ttcttacgcg 
ttgccggatg 
aaacatgaca 
agcgatattg 
gagttcatgc 
ggttcctgct 
cctggaaata 



caaaaatagc 
tcaatgaggg 
acaaaaaaat 
acttttctgc 
gcagtgtgcg 
aagacctacc 
gggtgctggg 
ggacgatgac 
ggtggagcat 
gccgcaacgc 



agattcaccc 
tgcgcgttgg 
tgccttggct 
aagacatcat 
caaccgctat 
cacgatgtgg 
aacatttgaa 
tgcccggcgt 
accgaccaat 
gcttgctgcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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I 

atacaggata atgcgcgtct cattggcgtg ccggttgttc atgccgctca ttgcggtgaa 660 

attgagtgtc cgatgccagg attgccgata ggttacaggg ggttctttga gggtaacgcg 720 

gccattgtga atgcagaagg tcaggtgctt gcgcatcggg gtgctggcga gggcgaagga 7 80 

attgtttgcg cggagatttt accggtagcc aaatcaaaca ggtcggaaat tcccaatcgt 840 

tactggttgc gctgcagagg ctttctacct atttttgcct ggcatcagca acgttggttg 900 

ggaaggcatt ggtatttgcg caatgtgcgc aggacttaa 939 

<210> 2 
<211> 312 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 2 



Met 


Gl ii 

OX LI 


xi y o 


T we* 


lie 


Ly s 


V c± 1 


7\ "f o 
£T.XCi 


1 








5 








Ala 


Asp 


Ser 




Val 


rioil 


11c 


bl Li 








20 










Ala 


Val 


Asn 


VjX U 


uiy 


Ala 
nla 


Arcf 


Trp 






35 












Thr 


Gly 


Val 


Ser 


T rn 
ir P 


noil 




Lys 




50 










55 




Asp 


Gly 


Ly s 


Ala 


Ala 


1*1 C L. 




Leu 


65 










70 






Val 


Leu 


He 


Gly 


Gly 


Ser 


Phe 


XlC Li 










85 








Arg 


Asn 


Arg 


Tvr 

± y j - 


Met 


Cys 


j.yx 


Ala 








100 










Asp 


Ly s 




Leu 


p-rn 


1 XIX 


Mot 


Trp 






115 










120 


Asp 


Ser 


Ser 


Asp 


He 


Gly 


Val 


Leu 




130 










135 




Gly 


Ala 


Ala 


Val 


Cys 


Trp 


Glu 


Phe 


145 










150 






Leu 


Arg 


Asn 


Gin 


Val 


Asp 


Val 


He 










165 








lie 


Pro 


Thr 


Asn 


Phe 


Pro 


Gly 


Phe 








180 










Asn 


Ser 


Arg 


Asn 


Ala 


Leu 


Ala 


Ala 






195 










200 


Gly 


Val 


Pro 


Val 


Val 


His 


Ala 


Ala 




210 










215 




Met 


Pro 


Gly 


Leu 


Pro 


He 


Gly 


Tyr 


225 










230 






Ala 


He 


Val 


Asn 


Ala 


Glu 


Gly 


Gin 










245 








Glu 


Gly 


Glu 


Gly 


lie 


Val 


Cys 


Ala 








260 










Asn 


Arg 


Ser 


Glu 


He 


Pro 


Asn 


Arg 






275 










280 


Leu 


Pro 


He 


Phe 


Ala 


Trp 


His 


Gin 




290 










295 




Tyr 


Leu 


Arg 


Asn 


Val 


Arg 


Arg 


Thr 


305 










310 







nla 


Tic 


CI Tl 
V3XIL 


1 It: 




inr 


Lys 


lie 




10 










x D 




Asn 




flu 


Arg 


Leu 




Leu 


ber 












30 






lie 


ri.la 


Leu 


"D -K- 

-tr x (_) 


IjX u 


Phe 


Phe 


Asn 


















lie 


jo.1 a 


Leu 


.fix a 


1X6 


Gin 


Thr 


Pro 


















rlx y 






OCX 




Arg 


His 


rllS 






75 










fin 


^yo 


riX y 


Leu 


O v- r^i 

•tr r O 


Asp 


Gly 


Ser 


val 




90 










95 




Asn 


Gly 


Ala 


Leu 


Val 


Gly 


Lys 


His 


105 










110 






Glu 


Asn 


Ala 


Phe 


Tyr 


Glu 


Gly 


Gly 










125 








Gly 


Thr 


Phe 


Glu 


Asn 


Thr 


Arg 


Val 








140 










Met 


Arg 


Thr 


Met 


Thr 


Ala 


Arg 


Arg 






155 










160 


Met 


Gly 


Gly 


Ser 


Cys 


Trp 


Trp 


Ser 




170 










175 




Val 


Gin 


Lys 


Leu 


Trp 


Glu 


Pro 


Gly 


185 










190 






He 


Gin 


Asp 


Asn 


Ala 


Arg 


Leu 


He 










205 








His 


Cys 


Gly 


Glu 


He 


Glu 


Cys 


Pro 








220 










Arg 


Gly 


Phe 


Phe 


Glu 


Gly Asn 


Ala 






235 










240 


Val 


Leu 


Ala 


His 


Arg 


Gly Ala 


Gly 




250 










255 




Glu 


He 


Leu 


Pro 


Val 


Ala 


Lys 


Ser 


265 










270 






Tyr 


Trp 


Leu 


Arg 


Cys 


Arg 


Gly 


Phe 










285 








Gin 


Arg 


Trp 


Leu 


Gly Arg 


His 


Trp 



300 



<210> 3 
<211> 981 
<212> DNA 
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<213> Unknown 
<220> 

<223> Obtained from an environmental sample 



<400> 3 

atggggaatt cgttcaagat cgcggtggta caagcctgtc cggtctttct ggatcgtggc 60 

gcgacagtcg ccaaggcatg ccgcctgatc gcggaggcag ccgcggcggg cgccagcctg 120 

gtggtctttc cggaggcgtt tgtgcccgga tacccactgt gggtctggtt cattccggca 180 

gggcattcgc aaccactgcg ggagttatac gccgaactgg tggggaacgc cgtggcggta 2 40 

ccgggcgatg ccaccgatcg gctttgcgcg gcagccagag aagccggcgt ggtagtggcg 300 

atcggcatca atgaagtgaa cagcgaagcc agcggcacga cgatttacaa tacgctgctg 3 60 

tacatcggag cggacggcgc gattctgggc aaacaccgca aagtaatgcc gacgggcgga 4 20 

gagcgcctgg tctgggcgct tggcgatggg agcgacctgg aggtctacga cctgcctttc 4 80 

ggccgattgg gtggcctgtt gtgctgggag aactacatgc ccctggcccg gtacgcgatg 540 

tcggcatggg gaaccgagat ctacgtggct ccaacttggg atcgcggaga accgtggctg 600 

tccacaatgc ggcatatcgc gaaagaaggg cgatgctacg tagtgggatg ctgcagttgc 6 60 

atgaaaattg acgatgtacc cgaccggctg gcgttcaaag ggaagtatct gtcgacggcc 72 0 

gagggctggc tcaaccccgg cgatagcgta atcgtcgatc cggacggcaa gctgatcgcg 7 80 

ggcccggcaa gcgagcagga gacgattctg tatgccgatg ccgaccggtc taagatcacc 8 40 

gggcccaggt ggcagttgga tgtggccggc cactacgcgc ggccggatat cttcgaactg 900 

atcgtgcacc gcgaacctaa gcgatttttg acgatagctc cgcggacgaa ggaggagcgg 960 

gagcctgggc cggaggcctg a 981 



<210> 4 
<211> 326 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 4 



Met 


Gly 


Asn 


Ser 


Phe 


Lys 


He 


Ala 


Val 


Val 


Gin 


Ala 


Cys 


Pro 


Val 


Phe 


1 








5 










10 










15 




Leu 


Asp 


Arg 


Gly 


Ala 


Thr 


Val 


Ala 


Lys 


Ala 


Cys 


Arg 


Leu 


He 


Ala 


Glu 








20 










25 










30 






Ala 


Ala 


Ala 


Ala 


Gly 


Ala 


Ser 


Leu 


Val 


Val 


Phe 


Pro 


Glu 


Ala 


Phe 


Val 






35 










40 










45 








Pro 


Gly 


Tyr 


Pro 


Leu 


Trp 


Val 


Trp 


Phe 


He 


Pro 


Ala 


Gly 


His 


Ser 


Gin 




50 










55 










60 










Pro 


Leu 


Arg 


Glu 


Leu 


Tyr 


Ala 


Glu 


Leu 


Val 


Gly 


Asn 


Ala 


Val 


Ala 


Val 


65' 










70 










75 










80 


Pro 


Gly 


Asp 


Ala 


Thr 


Asp 


Arg 


Leu 


Cys 


Ala 


Ala 


Ala 


Arg 


Glu 


Ala 


Gly 










85 










9a 










95 




Val 


Val 


Val 


Ala 


He 


Gly 


lie 


Asn 


Glu 


Val 


Asn 


Ser 


Glu 


Ala 


Ser 


Gly 








100 










105 










110 






Thr 


Thr 


He 


Tyr 


Asn 


Thr 


Leu 


Leu 


Tyr 


He 


Gly 


Ala 


Asp 


Gly Ala 


He 






115 










120 










125 








Leu 


Gly 


Lys 


His 


Arg 


Lys 


Val 


Met 


Pro 


Thr 


Gly 


Gly 


Glu 


Arg 


Leu 


Val 




130 










135 










140 










Trp 


Ala 


Leu 


Gly 


Asp 


Gly 


Ser 


Asp 


Leu 


Glu 


Val 


Tyr 


Asp 


Leu 


Pro 


Phe 


145 










150 










155 










160 


Gly 


Arg 


Leu 


Gly 


Gly 


Leu 


Leu 


Cys 


Trp 


Glu 


Asn 


Tyr 


Met 


Pro 


Leu 


Ala 










165 










170 










175 




Arg 


Tyr 


Ala 


Met 


Ser 


Ala 


Trp 


Gly 


Thr 


Glu 


He 


Tyr 


Val 


Ala 


Pro 


Thr 








180 










185 










190 






Trp 


Asp 


Arg 


Gly 


Glu 


Pro 


Trp 


Leu 


Ser 


Thr 


Met 


Arg 


His 


He 


Ala 


Lys 



195 200 205 
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Glu 


Gly 
210 


Arg 


Cys 


Tyr 


Val 


Val 
215 


Gly 


Cys 


Cys 


Ser 


Cys 
220 


Met 


Lys 


He 


Asp 


Asp 


Val 


Pro 


Asp 


Arg 


Leu 


Ala 


Phe 


Lys 


Gly 


Lys 


Tyr 


Leu 


Ser 


Thr 


Ala 


225 










230 










235 










240 


Glu 


Gly 


Trp 


Leu 


Asn 
245 


Pro 


Gly 


Asp 


Ser 


Val 
250 


He 


Val 


Asp 


Pro 


Asp 
255 


Gly 


Lys 


Leu 


lie 


Ala 
260 


Gly 


Pro 


Ala 


Ser 


Glu 
265 


Gin 


Glu 


Thr 


He 


Leu 
270 


Tyr 


Ala 


Asp 


Ala 


Asp 
275 


Arg 


Ser 


Lys 


He 


Thr 
280 


Gly 


Pro 


Arg 


Trp 


Gin 
285 


Leu 


Asp 


Val 


Ala 


Gly 
290 


His 


Tyr 


Ala 


Arg 


Pro 
295 


Asp 


He 


Phe 


Glu 


Leu 
300 


He 


Val 


His 


Arg 


Glu 


Pro 


Lys 


Arg 


Phe 


Leu 


Thr 


He 


Ala 


Pro 


Arg 


Thr 


Lys 


Glu 


Glu 


Arg 


305 










310 










315 










320 


Glu 


Pro 


Gly 


Pro 


Glu 
325 


Ala 























<210> 5 
<211> 1005 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 5 

atgggtacca agcacccggc cttcaaggcc gcagtggtcc aggccgcgcc ggaatggctc 60 

gatctcgacc gcaccgtcga caagaccatc gcgctgatcg aggaggccgc cggcgccggc 120 

gcgaagctca ttgcgttccc ggaaacctgg attcccggct atccgtggca catctgggtc 180 

ggcacgccgg cgtgggcgat cagccgcggc ttcgtgcagc gctacttcga caattcactg 24 0 

gcctacgaca gcccgcaggc ccagcgcatc gcggacgccg cgaagaagaa caagatcacc 300 

gtggtgctcg gcctgtcgga gcgcgagggt ggcagccttt atatctcgca gtggctgatt 3 60 

gggccggacg gcgagaccat tgccaagcgc cgcaaactgc gccccaccca cgtcgagcgc 420 

accgtgttcg gcgatggcga cggcagccac atcgcggtgc acgagcgtgc tgacatcggc 4 80 

cgcctcggcg cgctgtgctg ctgggagcac atccagccgc tgaccaaata cgccatgtat 54 0 

gcccagaacg agcaggtgca cgtcgccgcc tggccgagct tctcgatgta cgagccgttc 600 

gcccacgcgc tcggctggga agtcaacaat gcggcgagca agatctacgc cgtcgaaggc 660 

tcgtgtttcg tgctcggcgc atgcgcggtg atctcgcagg cgatggtcga cgaaatgtgc 720 

gacaccgagg acaagcgggc gctggtccat gccggcggcg gccacgcggt gatcttcggg 7 80 

ccggacggca gatcgctggc ggacaagatt ccggagaccc aggaaggcct gctctatgcc 84 0 

gacatcgacc tcggcgcaat tggcgtggcc aagaacgcgg ccgatccggc ggggcactac 900 

tcgcgcccgg acgtgacgcg gctcctgttc aacaacaagc cggcgcgccg ggtcgagtat 960 

ttctcgctgc cggtcgacgc ggtcgagacg ccgccgcagc cctga 1005 

<210> 6 
<211> 334 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 6 

Met Gly Thr Lys His Pro Ala Phe Lys Ala Ala Val Val Gin Ala Ala 

15 10 15 

Pro Glu Trp Leu Asp Leu Asp Arg Thr Val Asp Lys Thr He Ala Leu 

20 25 30 

He Glu Glu Ala Ala Gly Ala Gly Ala Lys Leu He Ala Phe Pro Glu 
35 40 45 
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Thr 


Trp 


He 


Pro 


Gly 


Tyr 


Pro 


Trp 


His 


He 


Trp 


Val 


Gly 


Thr 


Pro 


Ala 




50 










55 










60 










Trp 


Ala 


He 


Ser 


Arg 


Gly 


Phe 


Val 


Gin 


Arg 


Tyr 


Phe 


Asp 


Asn 


Ser 


Leu 


65 










70 










75 










80 


Ala 


Tyr 


Asp 


Ser 


Pro 


Gin 


Ala 


Gin 


Arg 


He 


Ala 


Asp 


Ala 


Ala 


Lys 


Lys 










85 










90 










95 




Asn 


Lys 


He 


Thr 


Val 


Val 


Leu 


Gly 


Leu 


Ser 


Glu 


Arg 


Glu 


Gly 


Gly 


Ser 








100 










105 










110 






Leu 


Tyr 


He 


Ser 


Gin 


Trp 


Leu 


He 


Gly 


Pro 


Asp 


Gly 


Glu 


Thr 


He 


Ala 






115 










120 










125 








Lys 


Arg 


Arg 


Lys 


Leu 


Arg 


Pro 


Thr 


His 


Val 


Glu 


Arg 


Thr 


Val 


Phe 


Gly 




130 










135 










140 










Asp 


Gly 


Asp 


Gly 


Ser 


His 


He 


Ala 


Val 


His 


Glu 


Arg 


Ala 


Asp 


He 


Gly 


145 










150 










155 










160 


Arg 


Leu 


Gly 


Ala 


Leu 


Cys 


Cys 


Trp 


Glu 


His 


He 


Gin 


Pro 


Leu 


Thr 


Lys 










165 










170 










175 




Tyr 


Ala 


Met 


Tyr 


Ala 


Gin 


Asn 


Glu 


Gin 


Val 


His 


Val 


Ala 


Ala 


Trp 


Pro 








180 










185 










190 






Ser 


Phe 


Ser 


Met 


Tyr 


Glu 


Pro 


Phe 


Ala 


His 


Ala 


Leu 


Gly 


Trp 


Glu 


Val 






195 










200 










205 








Asn 


Asn 


Ala 


Ala 


Ser 


Lys 


He 


Tyr 


Ala 


Val 


Glu 


Gly 


Ser 


Cys 


Phe 


Val 




210 










215 










220 










Leu 


Gly 


Ala 


Cys 


Ala 


Val 


He 


Ser 


Gin 


Ala 


Met 


Val 


Asp 


Glu 


Met 


Cys 


225 










230 










235 










240 


Asp 


Thr 


Glu 


Asp 


Lys 


Arg 


Ala 


Leu 


Val 


His 


Ala 


Gly 


Gly 


Gly 


His 


Ala 










245 










250 










255 




Val 


lie 


Phe 


Gly 


Pro 


Asp 


Gly 


Arg 


Ser 


Leu 


Ala 


Asp 


Lys 


He 


Pro 


Glu 








260 










265 










270 






Thr 


Gin 


Glu 


Gly 


Leu 


Leu 


Tyr 


Ala 


Asp 


He 


Asp 


Leu 


Gly 


Ala 


He 


Gly 






275 










280 










285 








Val 


Ala 


Lys 


Asn 


Ala 


Ala 


Asp 


Pro 


Ala 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 




290 










295 










300 










Val 


Thr 


Arg 


Leu 


Leu 


Phe 


Asn 


Asn 


Lys 


Pro 


Ala 


Arg 


Arg 


Val 


Glu 


Tyr 


305 










310 










315 










320 


Phe 


Ser 


Leu 


Pro 


Val 


Asp 


Ala 


Val 


Glu 


Thr 


Pro 


Pro 


Gin 


Pro 







325 330 



<210> 7 
<211> 999 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 7 

atgccgagtg attatcatgc tccattcaaa gtagcagttg tccaggcaac tcccgtcttt 60 

ctcgatcgca gcgcgacgat tgagaaggca tgtgagctaa ttgcctgtgc tggacgtgag 12 0 

ggcgcacgtc tgatcgtgtt tcctgaagcg ttcattccca cctatcccga ttgggtctgg 180 

accattccac ctggggagat gcggctgctt ggcgaactct acacagagtt gcttgccaat 24 0 

gcggtcacga tccccagtaa tgcaacggat aggctctgcc aggctgcgaa acgagctgct 300 

gcgtatgtgg tcatgggaat gaacgaacgc aatatcgagg cgagtggaag gagtctctat 360 

aacaccctgt tatacatcga tgctcagggc cagatcatgg gcaaacaccg caagttgata 420 

cccacagccg gtgagcggct catatgggcg caaggagatg ggagtacatt ccaggtctac 48 0 

gatactcctc tgggcaaact gggagggctc atctgctggg aaaactacat gcctctggct 54 0 

cgctatgcga tgtatgcctg gggcacgcag atttatgtcg ccccgacatg ggatcgtggc 600 

aacctctggc tctctactct gcggcatatc gctaaggagg gaggcgtcta tgttcttggt 660 

tgtagtatgg tcatgcgcaa gaatgacatt cccgatcact ttgctttcaa agagcagttt 72 0 

tatgctactg tggacgaatg gatcaacgtt ggtgacagcg ccattgtcca tcccgagggg 78 0 

aactttcttg cgggaccggt gcgccacaaa gaagagattc tctatgcaga acttgatcca 8 40 
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cgccaatcgt gcggtccggg atggatgctc gatgtggctg ggcactatgc acgccctgat 900 
gtgtttgaat tgattgtcca cacagagatg cgacccatga tgaagcaaga agaggtagga 960 
ggagaaaata catctgaggg aggtgtacga ttcttgtaa 999 

<210> 8 
<211> 332 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 8 



Met 


Pro 


Ser 


Asp 


Tyr 


HIS 


Pi. 1.3. 


Pro 


P"h a 
JtrXlci 


Lys 


v ax 


TV 1 


vdX 


V O.J. 


bill 


a i ^ 


1 








c 
O 










i n 
X u 










13 




Thr 


Pro 


va_L 


nu — 

rne 


Leu 


Asp 


Arg 


Ser 


a 1 a 


XI1X 


X ic 


(Z~\ n 
Ol Li 


Lys 


nla 




Glu 








<c U 










z o 










i n 
o u 






Leu 


±xe 


Ma 


Cys 


7\1 a 


tjxy 


Arg 


<Ol U 


uiy 


rila. 


Arg 


Leu 


X Ifci 


V gLI 


Phe 


Pro 






OJ 










f± u 










4 3 








Glu 


Ala 


Phe 


xxe 


Pro 


inr 


Tyr 


Pro 


Asp 


Trp 


v ax 


Trp 


1 11 X 


Tip 
X lc 


irx u 


riu 




oU 




















D u 










Gly 


Glu 


Met 


Arg 


Leu 


Leu 


b-Ly 


blU 


Leu 


Tyr 


Thr 


Glu 


T 

Leu 


Leu 




Asn 


65 










l U 










75 










an 
o u 


Ala 


Val 


Thr 


lie 


Pro 


Ser 


Asn 




rpV. — 

inr 


Asp 


Arg 


Leu 


Cys 


bin 


a i a 

nla 


Ala 










O O 




















QS 




Lys 


Arg 


/\xa 


ax a 


±\Xa. 


Tyr 


Val 


V dl 


ric u 


wljy 


Met 


Asn 




Arg 


Asn 


He 








1 U(J 










IUj 










x x u 






blU 


Pi. ±3. 


Ser 


t^xy 


Arg 


Ser 


L6U 


Tyr 


Asn 


T"h r* 
1 Ilx 


Leu 


Leu 




He 


Asp 


Ala 






llo 










ion 










IOC, 

JL 4^ J 








Gin 


Gly 


bin 


lie 


wei: 


uiy 


Lys 


riis 


Arg 


Lys 


Leu 


He 


JTX U 


Thr 
±111. 


nla 


oiy 




lo U 










loo 










140 










Glu 


Arg 


Leu 


He 


J. I]J 


Ala 


Gin 


Gly 


Asp 


Glv 


Ser 


Thr 


Phe 


Gin 


Val 


Tvr 


145 










150 










155 










160 


Asp 


Thr 


Pro 


Leu 


Gly 


Lys 


Leu 


Gly 


Gly 


Leu 


He 


Cys 


Trp 


Glu 


Asn 


Tyr 










165 










170 










175 




Met 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ala 


Met 


Tyr 


Ala 


Trp 


Gly 


Thr 


Gin 


He 


Tyr 








180 










185 










190 






Val 


Ala 


Pro 


Thr 


Trp 


Asp 


Arg 


Gly 


Asn 


Leu 


Trp 


Leu 


Ser 


Thr 


Leu 


Arg 






195 










200 










205 








His 


He 


Ala 


Lys 


Glu 


Gly 


Gly 


Val 


Tyr 


Val 


Leu 


Gly 


Cys 


Ser 


Met 


Val 




210 










215 










220 










Met 


Arg 


Lys 


Asn 


Asp 


He 


Pro 


Asp 


His 


Phe 


Ala 


Phe 


Lys 


Glu 


Gin 


Phe 


225 










230 










235 










240 


Tyr 


Ala 


Thr 


Val 


Asp 


Glu 


Trp 


He 


Ash 


Val 


Gly Asp 


Ser 


Ala 


He 


Val 










245 










250 










255 




His 


Pro 


Glu 


Gly 


Asn 


Phe 


Leu 


Ala 


Gly 


Pro 


Val 


Arg 


His 


Lys 


Glu 


Glu 








260 










265 










270 






lie 


Leu 


Tyr 


Ala 


Glu 


Leu 


Asp 


Pro 


Arg 


Gin 


Ser 


Cys 


Gly 


Pro 


Gly 


Trp 






275 










280 










285 








Met 


Leu 


Asp 


Val 


Ala 


Gly 


His 


Tyr 


Ala 


Arg 


Pro 


Asp 


Val 


Phe 


Glu 


Leu 




290 










295 










300 










lie 


Val 


His 


Thr 


Glu 


Met 


Arg 


Pro 


Met 


Met 


Lys 


Gin 


Glu 


Glu 


Val 


Gly 


305 










310 










315 










320 


Gly 


Glu 


Asn 


Thr 


Ser 


Glu 


Gly 


Gly 


Val 


Arg 


Phe 


Leu 











325 330 



<210> 9 
<211> 945 
<212> DNA 
<213> Unknown 
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<220> 

<223> Obtained from an environmental sample 



<400> 9 

atggcggctc acaagatcgc ggtggttcag gcgcccagcg ttctcctcga tcgcgagggc 60 

tcggtcgcgc gcgcggtcac gctgctcgac gaggcggcgg cggccggcgc ccgcctggtc 120 

gtgtttccgg aggcctacat ccccggctac ccggactgga tctggcgcct gcgcccctac 180 

ccggacgtca agctggccgc cgagctgcac gaacggctgc tcgccaacgc ggtggatctc 24 0 

tccaccgacg tgctggcgcc ggtgctggcg gcggcggcgc gtcacgggct caccgtggtc 300 

atgtgcgtgc aggagcgcga cgccggattc agccgcgcca cactttacaa caccgcgctg 360 

gtcatcgacg ccgccggcaa gatcgcgaac cggcaccgca agctcatgcc caccaacccc 420 

gagcgaatgg tgtggggatt cggtgacgcc tcggggctgc gggtggtgag cacgcccgtc 48 0 

gggcgggtgg gcacgctcct gtgctgggag agctacatgc ccctggcgcg ctgcgcgctc 54 0 

tacgccgagg gggtcgagat ctacgtgacc ccgacctggg actacggcga aggctggcgc 600 

gccagcatgc agcacatcgc ccgcgagggg cgctgctggg tggtgaccgc ttgcatgtgc 660 

gtgcaggcgc gcgacgtgcc ggccgacttc cccgggcgcg cccagctcta ccccgacgag 720 

gaggagtggt tgaaccccgg cgattcgctg gtggtcgacc ccggcggcaa gatcgtggcc 7 80 

ggtccgatgt cgcgcgagaa ggggatcttg tacgcggaga tcgatccgga tcgcgtggcg 84 0 

ggggcgcacc gctcgttcga cgtcgtgggc cactactcgc gtcccgacgt gttccggctg 900 

gaggtcgatc ggacaccggc ggcgccggtg agcttcaaaa aatga 94 5 



<210> 10 
<211> 314 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 10 



Met 


Ala 


Ala 


His 


Lys 


He 


Ala 


Val 


1 








5 








Asp 


Arg 


Glu 


Gly 


Ser 


Val 


Ala 


Arg 








20 










Ala 


Ala 


Ala 


Gly 


Ala 


Arg 


Leu 


Val 






35 










40 


Gly 


Tyr 


Pro 


Asp 


Trp 


He 


Trp 


Arg 




50 










55 




Leu 


Ala 


Ala 


Glu 


Leu 


His 


Glu 


Arg 


65 










70 






Ser 


Thr 


Asp 


Val 


Leu 


Ala 


Pro 


Val 










85 








Leu 


Thr 


Val 


Val 


Met 


Cys 


Val 


Gin 








100 










Ala 


Thr 


Leu 


Tyr 


Asn 


Thr 


Ala 


Leu 






115 








120 


Ala 


Asn 


Arg 


His 


Arg 


Lys 


Leu 


Met 




130 










135 




Trp 


Gly 


Phe 


Gly 


Asp 


Ala 


Ser 


Gly 


145 










150 






Gly 


Arg 


Val 


Gly 


Thr 


Leu 


Leu 


Cys 










165 








Arg 


Cys 


Ala 


Leu 


Tyr 


Ala 


Glu 


Gly 








180 










Trp 


Asp 


Tyr 


Gly 


Glu 


Gly 


Trp 


Arg 






195 










200 


Glu 


Gly 


Arg 


Cys 


Trp 


Val 


Val 


Thr 




210 










215 





Val 


Gin 


Ala 


Pro 


Ser 


Val 


Leu 


Leu 




10 










15 




Ala 


Val 


Thr 


Leu 


Leu 


Asp 


Glu 


Ala 


25 










30 






Val 


Phe 


Pro 


Glu 


Ala 


Tyr 


He 


Pro 










45 








Leu 


Arg 


Pro 


Tyr 


Pro 


Asp 


Val 


Lys 








60 










Leu 


Leu 


Ala 


Asn 


Ala 


Val 


Asp 


Leu 






75 










80 


Leu 


Ala 


Ala 


Ala 


Ala 


Arg 


His 


Gly 




90 










95 




Glu 


Arg 


Asp 


Ala 


Gly 


Phe 


Ser 


Arg 


105 










110 






Val 


He 


Asp 


Ala 


Ala 


Gly 


Lys 


He 










125 








Pro 


Thr 


Asn 


Pro 


Glu 


Arg 


Met 


Val 








140 










Leu 


Arg 


Val 


Val 


Ser 


Thr 


Pro 


Val 




155 










160 


Trp 


Glu 


Ser 


Tyr 


Met 


Pro 


Leu 


Ala 


170 










175 




Val 


Glu 


He 


Tyr 


Val 


Thr 


Pro 


Thr 


185 










190 






Ala 


Ser 


Met 


Gin 


His 


He 


Ala 


Arg 










205 








Ala 


Cys 


Met 


Cys 


Val 


Gin 


Ala 


Arg 



220 
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Asp Val Pro Ala Asp Phe Pro Gly Arg Ala Gin Leu Tyr Pro Asp Glu 
225 230 235 240 

Glu Glu Trp Leu Asri Pro Gly Asp Ser Leu Val Val Asp Pro Gly Gly 

2 45 250 255 

Lvs He Val Ala Gly Pro Met Ser Arg Glu Lys Gly He Leu Tyr Ala 

260 265 270 

Glu He Asp Pro Asp Arg Val Ala Gly Ala His Arg Ser Phe Asp Val 

275 280 285 

Val Gly His Tyr Ser Arg Pro Asp Val Phe Arg Leu Glu Val Asp Arg 

290 295 300 

Thr Pro Ala Ala Pro Val Ser Phe Lys Lys 
305 310 

<210> 11 
<211> 966 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



atgactggat cttatcctaa agacacactg atcgttgggc tagctcaaat cgctcctgtc 
tggctggatc gggcggggac actgtcaaag atactggctc aagtccatgc ggcaaatcaa 
gcgggttgtc atctcgtagc atttggcgaa gggctgcttc ctggatatcc gttttggatt 
gagcgaacaa atggcgcgct gttcaactcg actgtacaaa aggaaatcca cgcgcattat 
atggatcagg cggtgcagat cgaagccggt catctcgatc cgctttgtgc aacagccaag 
aaatttggaa tcaccgttgt actcggatgc atcgaacgcc cactcgatcg gggcggtcac 
agcttgtatg caagtctggt atatattgat tccgagggca gcattcaatc cgtgcatcgc 
aaactaatgc caacctacga agaacgactt acctggtcgt caggcgatgg gcacggttta 
cgagtgcata ccttaggtgc gtttacggtg ggtggtctca actgttggga aaattggatg 
cccttggcgc gcgcagcgat gtatggtcag ggtgaagatt tacatgttgc ^tctggcca 
ggcggttctc atctcacgca ggatattacc cgctttattg cgctcgaatc acgttcgtac 
gtattatctg tctccggtct gatgcgcgca accgattttc caaaagatac tccccatctt 
gcctccatcc tagctaaagg tgaagagatt cttgcgaatg gtggttcttg tattgcaggt 
cctgacggca agtgggtcgt tgggcctctt gtaggagaag agaagttaat tgtcgcaacc 
attgatcact gccgcgtgcg cgaagaacgt cagaatttcg atccttccgg gcattacagc 
cggcccgatg tactgcaatt aaaaatcaac agggaacgcc agagcacaat ttcatttagc 
gagtag 

<210> 12 
<211> 321 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
966 



<400> 12 

Met Thr Gly Ser Tyr Pro Lys Asp 

1 5 
He Ala Pro Val Trp Leu Asp Arg 

20 

Ala Gin Val His Ala Ala Asn Gin 

35 4 0 
Gly Glu Gly Leu Leu Pro Gly Tyr 

50 55 
Gly Ala Leu Phe Asn Ser Thr Val 
65 70 



Thr Leu He Val Gly Leu Ala Gin 

10 15 
Ala Gly Thr Leu Ser Lys He Leu 
25 30 
Ala Gly Cys His Leu Val Ala Phe 
45 

Pro Phe Trp He Glu Arg Thr Asn 
60 

Gin Lys Glu He His Ala His Tyr 
75 80 
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Met 


Asp 


Gin 


Ala 


Val 


Gin 


He 


Glu 


Ala 


Gly 


His 


Leu 


Asp 


Pro 


Leu 


Cys 








85 










90 










Q C. 

y d 




Ala 


Thr 


Ala 


Lys 
100 


Lys 


Phe 


Gly 


He 


Thr 
105 


Val 


Val 


Leu 


Gly 


Cys 

1 ID 


j_ ±e 




Arg 


Pro 


Leu 


Asp 


Arg 


Gly 


Gly 


His 


Ser 


Leu 


Tyr 


Ala 


Ser 


Leu 


t 7 — i n 

vSL± 


Tyr 




115 










120 










125 








He 


Asp 
130 


Ser 


Glu 


Gly 


Ser 


He 
135 


Gin 


Ser 


Val 


His 


Arg 
140 


Lys 


Leu 


Jxien 


Pro 


Thr 


Tyr 


Glu 


Glu 


Arg 


Leu 


Thr 


Trp 


Ser 


Ser 


Gly 


Asp 


Gly 


His 


Gly 


Leu 


145 








150 










155 










1 bU 


Arg 


Val 


His 


Thr 


Leu 


Gly 


Ala 


Phe 


Thr 


Val 


Gly 


Gly 


Leu 


Asn 


Cys 


Trp 








165 










170 










1 /O 




Glu 


Asn 


Trp 


Met 


Pro 


Leu 


Ala 


Arg 


Ala 


Ala 


Met 


Tyr 


Gly 


Gin 


Gly 


Glu 






180 










185 










190 






Asp 


Leu 


His 


Val 


Ala 


He 


Trp 


Pro 


Gly 


Gly 


Ser 


His 


Leu 


inr 


r*1 T-i 

L?-Ln 


Asp 




195 










200 










205 








He 


Thr 


Arg 


Phe 


He 


Ala 


Leu 


Glu 


Ser 


Arg 


Ser 


Tyr 


Val 


Leu 


Ser 


Val 




210 








215 










220 










Ser 


Gly 


Leu 


Met 


Arg 


Ala 


Thr 


Asp 


Phe 


Pro 


Lys 


Asp 


Thr 


Pro 


His 


Leu 


225 








230 










235 










d. 4 U 


Ala 


Ser 


He 


Leu 


Ala 


Lys 


Gly 


Glu 


Glu 


He 


Leu 


Ala 


Asn 




f~* 1 t r 

CaXy 


Ser 










245 








OCA 














Cys 


He 


Ala 


Gly 


Pro 


Asp 


Gly 


Lys 


Trp 


Val 


Val 


Gly 


Pro 


Leu 


Val 


Gly 






260 










265 










270 






Glu 


Glu 


Lys 
275 


Leu 


He 


Val 


Ala 


Thr 
280 


He 


Asp 


His 


Cys 


Arg 

285 


Val 


Arg 


Glu 


Glu 


Arg 

290 


Gin 


Asn 


Phe 


Asp 


Pro 
295 


Ser 


Gly 


His 


Tyr 


Ser 
300 


Arg 


Pro 


Asp 


Val 


Leu 


Gin 


Leu 


Lys 


He 


Asn 


Arg 


Glu 


Arg 


Gin 


Ser 


Thr 


He 


Ser 


Phe 


Ser 


305 








310 










315 










320 



Glu 



<210> 13 
<211> 1014 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 13 

atgggcattc 

gatctcgatg 

gccaagctga 

gactcgccgg 

gcctatgaca 

gccgtgatcg 

gggcccgatg 

accgtctatg 

cggctcggcg 

gcccagaacg 

gcgccggcgc 

tcgtgcttcg 

gaccgcgacg 

ccggacggct 

gacatcgatc 

tcgcggcccg 

ttttcgctgc 



aacatccgaa 
cgtcgatcgc 
tcgccttccc 
cctgggcgat 
gcccgcaggc 
gcctctccga 
gcgagaccat 
gcgaaggcga 
cgctgtgctg 
agcaggttca 
tcggctggga 
tgctggcgcc 
acaagcacgg 
cttcgattgc 
tcggcgccat 
acgtgacgcg 
cggtcgacaa 



atataaggtt 
caaatcgatc 
ggaggcgttc 
cggccgcggt 
cgagaagctg 
gcgcgagggc 
cgcaaagcgc 
tggcagcgat 
ctgggagcat 
tgtcgcggcc 
ggtcaacaat 
ctgcgcgacg 
gctgctgcat 
ggagaaattg 
cgggattgcc 
gctgttgctc 
tgtcgagccg 



gcggtggtgc 
gcgttgatcg 
atccctggct 
tttgtgcagc 
cggctggcgg 
ggcagccttt 
cgcaagctgc 
ctcgcggtgc 
ctgcagccgc 
tggccgagct 
gcggcctcac 
gtgtcgaagg 
gtcggcgggg 
ccgccggagc 
aagaacgccg 
aacaagaagc 
gagatcgacg 



aggcggcgcc 
aggaggcggc 
atccctggta 
gctatttcga 
tgaagaaggc 
atctcgcgca 
ggccgaccca 
atgaccgccc 
tgtcgaaata 
tctcgctcta 
gcgtctatgc 
cgatgatcga 
gacacgccgc 
aggagggcct 
ccgatccggc 
cgtcgaagcg 
ccgccgccag 



ggcctggctc 
tgccaatggc 
tatctggctg 
caactcgctg 
cggcctcacc 
atggctgatc 
tgccgagcgc 
cggcatcggc 
cgcgatgtat 
cgacccgttc 
ggtggaaggc 
cgagctctgc 
gatctatggg 
gctctatgcc 
cggacattac 
tgtcgagcat 
ctga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1014 



<210> 14 
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<211> 337 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 14 



Met 


Gly 


He 


Gin 


His 


Pro 


Lys 


Tyr 


Lys 


Val 


Ala 


Val 


Val 


Gin 


Ala 


Ala 


1 








5 










10 










15 




Pro 


Ala 


Trp 


Leu 

20 


Asp 


Leu 


Asp 


Ala 


Ser 
25 


He 


Ala 


Lys 


Ser 


He 
30 


Ala 


Leu 


lie 


Glu 


Glu 
35 


Ala 


Ala 


Ala 


Asn 


Gly 
40 


Ala 


Lys 


Leu 


He 


Ala 
45 


Phe 


Pro 


Glu 


Ala 


Phe 
50 


He 


Pro 


Gly 


Tyr 


Pro 
55 


Trp 


Tyr 


He 


Trp 


Leu 
60 


Asp 


Ser 


Pro 


Ala 


Trp 


Ala 


He 


Gly Arg 


Gly 


Phe 


Val 


Gin 


Arg 


Tyr 


Phe 


Asp 


Asn 


Ser 


Leu 


65 










70 










75 










80 


Ala 


Tyr 


Asp 


Ser 


Pro 
85 


Gin 


Ala 


Glu 


Lys 


Leu 
90 


Arg 


Leu 


Ala 


Val 


Lys 
95 


Lys 


Ala 


Gly 


Leu 


Thr 


Ala 


Val 


He 


Gly 


Leu 


Ser 


Glu 


Arg 


Glu 


Gly 


Gly 


Ser 






100 










105 










1 1U 






Leu 


Tyr 


Leu 
115 


Ala 


Gin 


Trp 


Leu 


lie 
120 


Gly 


Pro 


Asp 


Gly 


Glu 
125 


Thr 


He 


Ala 


Lys 


Arg 
130 


Arg 


Lys 


Leu 


Arg 


Pro 
135 


Thr 


His 


Ala 


Glu 


Arg 
140 


Thr 


Val 


Tyr 


Gly 


Glu 


Gly 


Asp 


Gly 


Ser 


Asp 


Leu 


Ala 


Val 


His 


Asp 


Arg 


Pro 


Gly 


lie 


Gly 


145 










150 










155 










1 OU 


Arg 


Leu 


Gly 


Ala 


Leu 


Cys 


Cys 


Trp 


Glu 


His 


Leu 


Gin 


Pro 


Leu 


Ser 


Lys 








165 










170 










175 




Tyr 


Ala 


Met 


Tyr 


Ala 


Gin 


Asn 


Glu 


Gin 


Val 


His 


Val 


Ala 


Ala 


Trp 


Pro 






180 










185 










i y u 






Ser 


Phe 


Ser 
195 


Leu 


Tyr 


Asp 


Pro 


Phe 
200 


Ala 


Pro 


Ala 


Leu 


Gly 
205 


Trp 


Glu 


Val 


Asn 


Asn 
210 


Ala 


Ala 


Ser 


Arg 


Val 
215 


Tyr 


Ala 


Val 


Glu 


Gly 

220, 


Ser 


Cys 


Phe 


Val 


Leu 


Ala 


Pro 


Cys 


Ala 


Thr 


Val 


Ser 


Lys 


Ala 


Met 


He 


Asp 


Glu 


Leu 


Cys 


225 










230 










235 










240 


Asp 


Arg 


Asp 


Asp 


Lys 
245 


His 


Gly 


Leu 


Leu 


His 
250 


Val 


Gly 


Gly 


Gly 


His 
255 


Ala 


Ala 


He 


Tyr 


Gly 
260 


Pro 


Asp 


Gly 


Ser 


Ser 
265 


He 


Ala 


Glu 


Lys 


Leu 
270 


Pro 


Pro 


Glu 


Gin 


Glu 
275 


Gly 


Leu 


Leu 


Tyr 


Ala 
280 


Asp 


He 


Asp 


Leu 


Gly 
285 


Ala 


He 


Gly 


He 


Ala 


Lys 


Asn 


Ala 


Ala 


Asp 


Pro 


Ala 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 




290 








295 










300 










Val 


Thr 


Arg 


Leu 


Leu 


Leu 


Asn 


Lys 


Lys 


Pro 


Ser 


Lys 


Arg 


Val 


Glu 


His 


305 








310 










315 










320 


Phe 


Ser 


Leu 


Pro 


Val 
325 


Asp 


Asn 


Val 


Glu 


Pro 
330 


Glu 


He 


Asp 


Ala 


Ala 
335 


Ala 



Ser 



<210> 15 
<211> 1047 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



Page 10 



WO 03/097810 



PCT/US03/15712 



<400> 15 

atgccaacat 

ctggagggcg 

aagctcattg 

tcacccgctt 

agtgagcaag 

ggttatagcg 

ggtgaaacga 

ggggagggag 

atgtgctgtt 

caaattcaca 

ggccccgaac 

cttgcccctt 

aaagagccgc 

cctctggcta 

ggcatgattt 

gtcactcgcc 

tcaggcccaa 

aaggcggccc 



caaaacaatt 

caataagcaa 

ccttcccgga 

ggggcatgcg 

ccaagcgcat 

aacgcagtgg 

tttttacccg 

acggcagcca 

gggaacattt 

ttgcctcctg 

tgaacaacgc 

gcgccaccgt 

ttctgctgga 

aaccgttgcc 

caatggccaa 

tactattcaa 

ccgaatcctt 

tgcgagagat 



tagagtcgct 
aggcatctcc 
aacgtggatt 
ctttgtccag 
gaaccaggct 
cggcagcctc 
ccgcaaactc 
tctttgcgta 
gcagccgctc 
gccgagcttt 
cgccagccaa 
ctcaaaggag 
aggtggcggg 
tgaaaacgag 
ggctgccgcc 
ttccgcgccc 
aaaagatatg 
gatctaa 



gcagttcaag 
ctcattgagg 
cccggctacc 
cgctattttg 
gccgccaata 
tacatgggcc 
aagccaactc 
atggataccg 
agcaaatatg 
tcgttatatc 
atgtatgcag 
atgatcgaaa 
ttcaccatga 
gaagggctgc 
gacccggcag 
gccaatcgcg 
ggaaagatgc 



ccgccccggt 
aggccgcttc 
cctggtggat 
acaactcgct 
acaagattta 
aatccattat 
atgtcgagcg 
agattggccg 
caatgtattc 

ggggggcagc 

ccgaaggcca 
tgctgataga 
tttacggccc 
tatatgccga 
gtcactacgc 
tcgagtatat 
aaatggaggc 



atttcttgac 

caatggagcc 

ctggctggac 

catgctgggt 

cgtggtgatg 

caacgacaag 

taccgtgttt 

cgtcggcgcg 

tcaggatgaa 

ctatgcactc 

gtgctttgtc 

tgatcccagg 

cgatgggcga 

tattgacctg 

acgcccggat 

caacccagcg 

cgaacagcaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1047 



<210> 16 
<211> 348 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from 
<400> 16 

Met Pro Thr Ser Lys 

1 5 
Val Phe Leu Asp Leu 
20 

Glu Glu Ala Ala Ser 
35 

Trp lie Pro Gly Tyr 
50 

Gly Met Arg Phe Val 
' 65 

Ser Glu Gin Ala Lys 
85 

Tyr Val Val Met Gly 
100 

Gly Gin Ser lie lie 
115 

Lys Leu Lys Pro Thr 
130 

Gly Ser His Leu Cys 
145 

Met Cys Cys Trp Glu 
165 

Ser Gin Asp Glu Gin 
180 

Tyr Arg Gly Ala Ala 
195 

Ser Gin Met Tyr Ala 
210 

Ala Thr Val Ser Lys 
225 

Lys Glu Pro Leu Leu 
245 



an environmental sample 



Gin 

Glu 

Asn 

Pro 

Gin 
70 
Arg 

Tyr 

Asn 

His 

Val 
150 
His 

lie 

Tyr 

Ala 

Glu 

230 
Leu 



Phe Arg Val 
Gly 
Gly 



Trp 
55 
Arg 

Met 

Ser 

Asp 

Val 
135 
Met 

Leu 

His 

Ala 

Glu 
215 
Met 

Glu 



Ala lie 

25 
Ala Lys 
40 

Trp lie 



Tyr Phe 

Asn Gin 

Glu Arg 
105 
Lys Gly 
120 

Glu Arg 

Asp Thr 

Gin Pro 

lie Ala 
185 
Leu Gly 
200 

Gly Gin 
lie Glu 
Gly Gly 



Ala 
10 
Ser 

Leu 

Trp 

Asp 

Ala 
90 
Ser 

Glu 

Thr 

Glu 

Leu 
170 
Ser 

Pro 

Cys 

Met 

Gly 
250 



Ala Val 

Lys Gly 

lie Ala 

Leu Asp 

60 
Asn Ser 
75 

Ala Ala 

Gly Gly 

Thr lie 

Val Phe 
140 
lie Gly 
155 

Ser Lys 

Trp Pro 

Glu Leu 

Phe Val 
220 
Leu He 
235 

Phe Thr 



Gin 

He 

Phe 
45 
Ser 

Leu 

Asn 

Ser 

Phe 
125 
Gly 

Arg 

Tyr 

Ser 

Asn 
205 
Leu 

Asp 

Met 



Ala Ala Pro 
15 

Ser Leu lie 
30 

Pro Glu Thr 

Pro Ala Trp 

Met Leu Gly 
80 

Asn Lys lie 
95 

Leu Tyr Met 
110 

Thr Arg Arg 

Glu Gly Asp 

Val Gly Ala 
160 

Ala Met Tyr 
175 

Phe Ser Leu 
190 

Asn Ala Ala 

Ala Pro Cys 

Asp Pro Arg 
240 

He Tyr Gly 
255 
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Pro 


Asp 


O-Ly 


Z\ T"rr 


Pro 


Leu 


Ala 


Lys 


Pro 


Leu 


Pro 


Glu 


Asn 


Glu 


Glu 


Gly 




2 60 










265 










270 






Leu 


Leu 


Tyr 
27 5 


Ala 


Asp 


He 


Asp 


Leu 

280 


Gly 


Met 


He 


Ser 


Met 
285 


Ala 


Lys 


Ala 


Ala 


Ala 


Asp 


Pro 


Ala 


Gly 


His 


Tyr 


Ala 


Arg 


Pro 


Asp 


Val 


Thr 


Arg 


Leu 




290 








295 










300 










Leu 


Phe 


Asn 


Ser 


Ala 


Pro 


Ala 


Asn 


Arg 


Val 


Glu 


Tyr 


He 


Asn 


Pro 


Ala 


305 










310 










315 










320 


Ser 


Gly 


Pro 


Thr 


Glu 


Ser 


Leu 


Lys 


Asp 


Met 


Gly 


Lys 


Met 


Gin 


Met 


Glu 








325 










330 










335 




Ala 


Glu 


Gin 


Gin 
340 


Lys 


Ala 


Ala 


Leu 


Arg 
345 


Glu 


Met 


He 











<210> 17 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 17 

atgagagttg 

acggtcgaga 

acctttcctg 

cagcaaatct 

gccgccaccg 

gtcaacgagc 

agcttaattc 

cagggtgacg 

gcatgctggg 

atccattcgg 

atcaatatcc 

ctggacggcg 

tccggcggtt 

cgctcgggtg 

caggtgatgg 

cgcaccccta 

ggctccgagg 



ttaaagccgc 
aggtcgtgcg 
agaccgtggt 
tcggaacaga 
acgcgatcgg 
gagacggggg 
agcggcgccg 
gctcaggtct 
agcacaacaa 
ccatgtatcc 
ggcagcatgc 
atcaacaggc 
gcttcaccac 
agggcgtggt 
actcgcgagg 
ccgcgcactt 
atgtgttcga 



agctgtccaa 
gaagatccat 
gccttattac 
gtatctgagg 
cgaggctgcc 
aactctgtac 
caagatcacg 
gcgggccgtt 
tccactggcg 
gggctccatg 
gctggagtct 
gcaaatcatg 
tatcgtggcg 
catcgccgac 
ccactacagc 
tcacgaacgc 
ggctaacatt 



ctgagtcccg 
gaacttgccg 
ccgtactttt 
ctgctcgacc 
aggttcgctg 
aacactcagc 
cccacccatt 
gatagcaagg 
cgctacgcgc 
ttcggcgact 
gcgtgcttcg 
aaggacaccg 
ccggacggtt 
ctcgacttca 
cggccggagt 
gcttcgcacc 
taa 



tcctctatag 
aagagggagt 
cgttcgttca 
aggcagtcac 
gagttgttgt 
ttctcttcga 
acgagcgcat 
ccggccgcat 
tgatagccga 
cgtttgccaa 
tcgtgaacgc 
gctgcagcat 
ccctgatcgg 
cgttaatcga 
tgctcagcct 
ccacgacagg 



ccgcgaggga 
cgagttcgtc 
gacgcccttg 
cgtgccatcc 
ctcgatcggc 
tgccgacgga 
gatctggggc 
tggtcagctg 
cggcgagcag 
aaagaccgaa 
aacggcctgg 
cggcccgatc 
cgagcccctc 
caggcgtaag 
cttaatagac 
agctgagcaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
993 



<210> 18 
<211> 330 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from 



an environmental sample 



<400> 18 

Met Arg Val Val Lys 

1 5 
Ser Arg Glu Gly Thr 
20 

Ala Glu Glu Gly Val 
35 

Tyr Tyr Pro Tyr Phe 
50 

Gly Thr Glu Tyr Leu 
65 

Ala Ala Thr Asp Ala 
85 



Ala Ala Ala Val Gin Leu Ser Pro Val Leu Tyr 

10 15 

Val Glu Lys Val Val Arg Lys He His Glu Leu 

25 30 

Glu Phe Val Thr Phe Pro Glu Thr Val Val Pro 

40 45 

Ser Phe Val Gin Thr Pro Leu Gin Gin He Phe 

55 60 

Arg Leu Leu Asp Gin Ala Val Thr Val Pro Ser 
70 75 80 

He Gly Glu Ala Ala Arg Phe Ala Gly Val Val 
90 95 
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Val Ser 

Gin Leu 

lie Thr 
130 
Ser Gly 
145 

Ala Cys 

Asp Gly 

Asp Ser 

Glu Ser 
210 
Gin Gin 
225 

Ser Gly 

Gly Glu 

Phe Thr 

Tyr Ser 
290 
Ala His 
305 

Gly Ser 



He Gly Val 
100 

Leu Phe Asp 
115 

Pro Thr His 

Leu Arg Ala 

Trp Glu His 
165 

Glu Gin He 

180 
Phe Ala Lys 
195 

Ala Cys Phe 

Ala Gin He 

Gly Cys Phe 
245 

Pro Leu Arg 

260 
Leu He Asp 
275 

Arg Pro Glu 

Phe His Glu 

Glu Asp Val 
325 



Asn Glu 

Ala Asp 

Tyr Glu 
135 
Val Asp 
150 

Asn Asn 

His Ser 

Lys Thr 

Val Val 
215 
Met Lys 
230 

Thr Thr 

Ser Gly 

Arg Arg 

Leu Leu 
295 
Arg Ala 
310 

Phe Glu 



Arg Asp 
105 
Gly Ser 
120 

Arg Met 

Ser Lys 

Pro Leu 

Ala Met 
185 
Glu He 
200 

Asn Ala 

Asp Thr 

He Val 

Glu Gly 
265 
Lys Gin 
280 

Ser Leu 
Ser His 
Ala Asn 



Gly Gly 

Leu He 

He Trp 

Ala Gly 
155 
Ala Arg 
170 

Tyr Pro 

Asn He 

Thr Ala 

Gly Cys 
235 
Ala Pro 
250 

Val Val 

Val Met 

Leu He 

Pro Thr 
315 

He 
330 



Thr 

Gin 

Gly 
140 
Arg 

Tyr 

Gly 

Arg 

Trp 
220 
Ser 

Asp 

He 

Asp 

Asp 
300 
Thr 



Leu Tyr 
110 
Arg Arg 
125 

Gin Gly 

He Gly 

Ala Leu 

Ser Met 
190 
Gin His 
205 

Leu Asp 

He Gly 

Gly Ser 

Ala Asp 
270 
Ser Arg 
285 

Arg Thr 
Gly Ala 



Asn Thr 

Arg Lys 

Asp Gly 

Gin Leu 
160 
He Ala 
175 

Phe Gly 

Ala Leu 

Gly Asp 

Pro He 
240 
Leu He 
255 

Leu Asp 
Gly His 
Pro Thr 

Glu Gin 

320 



<210> 19 
<211> 1050 
<212> DNA 
<213> Unknown 

<220> ' 

<223> Obtained from an environmental sample 



<400> 19 

atggaaagca 

cagacggccg 

gtgacatttc 

gcctggggaa 

ccagaggaac 

ctcagcgaga 

aaagtggtcg 

gaaggggatg 

tgctgctggg 

atccatgctg 

ccggagctga 

gcatcctggg 

gccgcgctca 

ctggcgaagc 

gccatcacgc 

ttccgcctgt 

ccaggtgtca 

gagaacgctg 

<210> 20 
<211> 349 



acttccttgc 
aaaaggccgc 
ccgaaacgtg 
tgcatcattt 
agatcctttg 
aaatcggggc 
ctcgccgacg 
gttccgacat 
agcacatgca 
cttcttggcc 
acggtgccgc 
caacggtttc 
tgaaaattgg 
cgttgccgga 
gcgtgaaagc 
tgttcaatcg 
cccagcgaac 
tcgcccgggc 



cgcagcagtg 
gtcattgatt 
gctgcccggt 
catcctaaag 
tcaggcggcc 
aagcctctac 
caaattgaag 
tgtcgttctt 
gccactttcg 
gagtgttagc 
caatcagatg 
acaagcggct 
tggtggtttt 
ggacgtcgaa 
agcggcggac 
tacgcgccaa 
cgccaagaat 
tgcagagtaa 



caagcagaac 
gacgatgccg 
tacccgtact 
taccatcaaa 
cgccgcaacg 
atggcgcagt 
cctactcacg 
gatacacccc 
aagtacgcca 
gtctatcgcg 
tatgcggcag 
atcgatcttt 
tcccagatct 
ggattggtga 
cccgtagggc 
gaacgcgtgg 
gggtcggcgg 



cggtttactt 
gtcggcaggg 
ggatctggct 
actcgccggt 
ggatttttgt 
ggttcatcag 
tcgaacgttc 
ttggaaaggt 
tgtactcgca 
ataaaattta 
aaggtcagtg 
tttgcgacac 
atgggccaga 
ccgctgagat 
actatagccg 
tttctgtcaa 
acgaattggt 



caatgctttt 
ggctcgctta 
tggtgccccc 
tgcaggagga 
cgtcatggga 
tccagacggc 
ggtcttcggg 
cgggggcctt 
aggcgagcag 
cgttctgggg 
tttcgtcctg 
gcccgacaag 
cgggtgcccc 
tgacttcaat 
gcccgatgta 
cacgtttgtg 
cggtcacccg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1050 
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<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
<400> 20 

Met Glu Ser Asn Phe Leu Ala Ala Ala Val Gin Ala Glu Pro Val Tyr 

1 5 10 15 

Phe Asn Ala Phe Gin Thr Ala Glu Lys Ala Ala Ser Leu lie Asp Asp 

20 25 30 

Ala Gly Arg Gin Gly Ala Arg Leu Val Thr Phe Pro Glu Thr Trp Leu 

35 40 45 

Pro Gly Tyr Pro Tyr Trp He Trp Leu Gly Ala Pro Ala Trp Gly Met 

50 55 60 

His His Phe He Leu Lys Tyr His Gin Asn Ser Pro Val Ala Gly Gly 
65 70 75 80 

Pro Glu Glu Gin lie Leu Cys Gin Ala Ala Arg Arg Asn Gly He Phe 

85 90 95 

Val Val Met Gly Leu Ser Glu Lys He Gly Ala Ser Leu Tyr, Met Ala 

100 105 no 

Gin Trp Phe He Ser Pro Asp Gly Lys Val Val Ala Arg Arg Arg Lys 

115 120 125 

Leu Lys Pro Thr His Val Glu Arg Ser Val Phe Gly Glu Gly Asp Gly 

130 135 140 

Ser Asp He Val Val Leu Asp Thr Pro Leu Gly Lys Val Gly Gly Leu 
14 5 150 155 160 

Cys Cys Trp Glu His Met Gin Pro Leu Ser Lys Tyr Ala Met Tyr Ser 

165 170 175 

Gin Gly Glu Gin He His Ala Ala Ser Trp Pro Ser Val Ser Val Tyr 

180 185 190 

Arg Asp Lys He Tyr Val Leu Gly Pro Glu Leu Asn Gly Ala Ala Asn 

195 200 205 

Gin Met Tyr Ala Ala Glu Gly Gin Cys Phe Val Leu Ala Ser Trp Ala 

210 215 220 

Thr Val Ser Gin Ala Ala He Asp Leu Phe Cys Asp Thr Pro Asp Lys 
225 230 • 235 240 

Ala Ala Leu Met Lys He Gly Gly Gly Phe Ser Gin He Tyr Gly Pro 

245 250 255 

Asp Gly Cys Pro Leu Ala Lys Pro Leu Pro Glu Asp Val Glu Gly Leu 

260 265 270 

Val Thr Ala Glu He Asp Phe Asn Ala He Thr Arg Val Lys Ala Ala 

275 280 285 

Ala Asp Pro Val Gly His Tyr Ser Arg Pro Asp Val Phe Arg Leu Leu 

290 295 300 

Phe Asn Arg Thr Arg Gin Glu Arg Val Val Ser Val Asn Thr Phe Val 
305 310 315 320 

Pro Gly Val Thr Gin Arg Thr Ala Lys Asn Gly Ser Ala Asp Glu Leu 

325 330 335 

Val Gly His Pro Glu Asn Ala Val Ala Arg Ala Ala Glu 
340 345 



<210> 21 
<211> 1065 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
<400> 21 
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atggcactag 

aatctagaca 

gcggccctga 

ggtcctcccg 

acgtacgata 

gtggtattgg 

ggcccggatg 

accgttttcg 

cttggcgcac 

cagcacgaac 

gcgctcggtc 

ttcgtgctcg 

gcagacaagc 

ggcgaagctc 

gatctcggtc 

cccgatgtct 

gagaacgacg 

gggccggcga 



aacatccgaa 

gagggatcga 

ttgcatttcc 

catgggcgct 

gtcctcaggc 

gcttgtcgga 

gcgcgacggt 

gcgatggcga 

tgtgctgctg 

aggtgcacgt 

acgaggtgaa 

cgccgtgcgc 

gcgcgatgat 

tggtcgagaa 

gcatctcgct 

tgcggctctg 

cgtcgcgcag 

acgacatggt 



gtacgtggcg 

aaagacgatc 

ggaaacctgg 

tggctcagga 

cgctgcactg 

gcgatgcggc 

cgccacgcgc 

cggcagcgat 

ggagcacatt 

cgcggcctgg 

caacgcagtc 

ggtcatcagc 

acgtgccggc 

actgccggaa 

tgcgaaggct 

gttcgacaag 

gtcgcacggg 

agaccgtcag 



gccgtggttc 

gcattgatcg 

ctgccgggct 

ttcgtccagc 

agggacgctg 

ggcagcctct 

cgcaaattgc 

ctggcagtac 

cagccgctga 

ccgagcttct 

agccgcgtct 

gagcaaatgg 

ggcgggcacg 

aatgaggaag 

gcggccgacc 

caaccgcggc 

tcgtccgggt 

gaaaacgtcg 



aggccgcgcc 

acgaagcggg 

atccgtttca 

gctatttcga 

ccgcgcgcaa 

atatcgcgca 

ggccgactca 

acgatctcaa 

ccaagtacgc 

ccatgtatga 

atgccgttga 

tcgacatgtt 

cagtggcgtt 

gcttgctgct 

ccgtcggtca 

ggtgcgtcga 

cacaatcgcc 

attga 



cgaattcctg 

acagaaaggg 

tgtctggctc 

caactcgatg 

cgggatcacg 

atggatcatc 

tatcgagcgc 

catcggccgc 

gatgtatgcg 

attcgcgccc 

gggatcgtgc 

gtgcgacacg 

cgggccggac 

ggtcgatatc 

ctacgcgcgc 

acatgccggc 

ggcgcaggat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1065 



<210> 22 
<211> 354 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from 



an environmental sample 



<400> 22 

Met Ala Leu Glu His Pro 

1 5 
Pro Glu Phe Leu Asn Leu 
20 

He Asp Glu Ala Gly Gin 
35 

Thr Trp Leu Pro Gly Tyr 
50 

Trp Ala Leu Gly Ser Gly 
65 70 
Thr Tyr Asp Ser Pro Gin 
85 

Asn Gly lie Thr Val Val 
100 

Leu Tyr He Ala Gin Trp 
115 

Thr Arg Arg Lys Leu Arg 
130 

Asp Gly Asp Gly Ser Asp 
145 150 
Leu Gly Ala Leu Cys Cys 
165 

Ala Met Tyr Ala Gin His 
180 

Phe Ser Met Tyr Glu Phe 
195 

Ala Val Ser Arg Val Tyr 
210 

Pro Cys Ala Val He Ser 
225 230 
Ala Asp Lys Arg Ala Met 
245 

Phe Gly Pro Asp Gly Glu 



Lys Tyr Val 
Asp 
Lys 



Pro 
55 
Phe 

Ala 

Leu 

He 

Pro 
135 
Leu 

Trp 

Glu 

Ala 

Ala 
215 
Glu 

He 

Ala 



Arg Gly 

25 
Gly Ala 
40 

Phe His 



Val Gin 

Ala Ala 

Gly Leu 
105 
He Gly 
120 

Thr His 

Ala Val 

Glu His 

Gin Val 
185 
Pro Ala 
200 

Val Glu 
Gin Met 
Arg Ala 
Leu Val 



Ala Ala 
10 

He Glu 

Ala Leu 

Val Trp 

Arg Tyr 

75 
Leu Arg 
90 

Ser Glu 

Pro Asp 

He Glu 

His Asp 
155 
He Gin 
170 

His Val 

Leu Gly 

Gly Ser 

Val Asp 
235 
Gly Gly 
250 

Glu Lys 



Val Val Gin 
Lys 
He 



Leu 
60 
Phe 

Asp 

Arg 

Gly 

Arg 
140 
Leu 

Pro 

Ala 

His 

Cys 
220 
Met 

Gly 

Leu 



Thr He 

30 
Ala Phe 
45 

Gly Pro 



Asp Asn 

Ala Ala 

Cys Gly 
110 
Ala Thr 
125 

Thr Val 

Asn He 

Leu Thr 

Ala Trp 
190 
Glu Val 
205 

Phe Val 



Ala Ala 
15 

Ala Leu 

Pro Glu 

Pro Ala 

Ser Met 
80 

Ala Arg 
95 

Gly Ser 

Val Ala 

Phe Gly 

Gly Arg 
160 
Lys Tyr 
175 

Pro Ser 
Asn Asn 
Leu Ala 



Leu Cys 
His Ala 
Pro Glu 



Asp Thr 
240 
Val Ala 
255 

Asn Glu 
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260 

Glu Gly Leu Leu 
275 

Lys Ala Ala Ala 

290 

Arg Leu Trp Phe 
305 

Glu Asn Asp Ala 

Pro Ala Gin Asp 
340 

Val Asp 



Leu Val Asp lie 
280 

Asp Pro Val Gly 
295 

Asp Lys Gin Pro 
310 

Ser Arg Arg Ser 
325 

Gly Pro Ala Asn 



265 

Asp Leu Gly Arg 

His Tyr Ala Arg 
300 

Arg Arg Cys Val 
315 

His Gly Ser Ser 
330 

Asp Met Val Asp 
345 



270 

lie Ser Leu Ala 
285 

Pro Asp Val Leu 

Glu His Ala Gly 
320 

Gly Ser Gin Ser 
335 

Arg Gin Glu Asn 
350 



<210> 23 
<211> 1005 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 23 

atgaccgtta 

acagtcgaga 

acctttcccg 

gacatgaagc 

tccgagacgg 

gtcaatgagc 

acgctcattc 

caaggcgatg 

gcctgctggg 

atccactcgg 

gttagtatcc 

cttagcgccg 

tccggtggct 

aaggcaggtg 

cgtgtgatgg 

cgcagtcagc 

ccaacggtca 



tcaaagcagc 

aagttgttag 

aaaccatcat 

ttgggagtga 

acgcgatcgc 

gcgatgggcg 

agcgtaggcg 

gatccggcct 

agcactacct 

caatgtatcc 

gccagcacgc 

agcagcaagc 

gctttacggc 

agggggaggt 

atacgcgagg 

accatcacat 

ccgcagagca 



cgccattcaa 

gaaggttaga 

accgtactac 

acatcagcgg 

ccaggccgcc 

atccatctac 

aaagatcacc 

acgcgcggtc 

tcccctggcg 

aggctcgttc 

gcttgagtca 

tcaaatagtg 

gattgttgat 

catcgcagat 

gcactacagc 

gactgagcga 

gtcggccgtc 



atcagccccg 

gagctcgggg 

ccgtacttct 

ctgctcgacg 

aaggaagcgg 

aacactcaac 

ccgacctatc 

gatagcgccg 

cggtacgccc 

gctggtccgc 

gcctgcttcg 

aaggataccg 

ccggagggtc 

ctcgattttg 

cgccccgaac 

ggcgccgatc 

gagccggcgg 



tgctctacag 

ccaaaggtgt 

cgttcgttca 

aatcagtcac 

gcatggtggt 

ttctgttcga 

acgagcgcat 

tgggccggat 

tcatcgcgga 

tatttgccga 

tcgtcaacgc 

gatgcgtcgt 

ggatcatggg 

cgcagattga 

ttctaagcct 

accgtgtaga 

aatga 



ccgggcgggg 
ccgattcgct 
gtcggcgttc 
aattccttcg 
gtccgtcggg 
cgctgatggc 
gatttggggt 
cggccagctt 
cggagagcaa 
gcagatagag 
gaccggatgg 
tggaccaatc 
ggcgccactc 
tttccgcaag 
cacgatcgac 
ccacgcaaag 



60 
12Q 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1005 



<210> 24 
<211> 334 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from 
<400> 24 

Met Thr Val lie Lys 

1 5 
Ser Arg Ala Gly Thr 
20 

Gly Ala Lys Gly Val 
35 

Tyr Tyr Pro Tyr Phe 
50 

Gly Ser Glu His Gin 
65 

Ser Glu Thr Asp Ala 



an environmental sample 

Ala Ala Ala He Gin He Ser Pro Val Leu Tyr 
10 1 5 

Val Glu Lys Val Val Arg Lys Val Arg Glu Leu 

25 30 

Arg Phe Ala Thr Phe Pro Glu Thr He He Pro 

40 45 

Ser Phe Val Gin Ser Ala Phe Asp Met Lys Leu 

55 60 

Arg Leu Leu Asp Glu Ser Val Thr He Pro Ser 
70 75 80 
He Ala Gin Ala Ala Lys Glu Ala Gly Met Val 
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85 










90 










y o 




Val 


Ser 


Val 


Gly Val 


Asn 


Glu 


Arg 


Asp 


Gly 


Arg 


Ser 


He 


Tyr 


Asn 


Thr 








100 










105 










110 






Gin 


Leu 


Leu 
115 


Phe 


Asp 


Ala 


Asp 


Gly 
120 


Thr 


Leu 


He 


Gin 


Arg 

125 


Arg 


Arg 


Lys 


He 


Thr 
130 


Pro 


Thr 


Tyr 


His 


Glu 
135 


Arg 


Met 


He 


Trp 


Gly 
140 


Gin 


Gly 


Asp 


Gly 


Ser 


Gly 


Leu 


Arg 


Ala 


Val 


Asp 


Ser 


Ala 


Val 


Gly Arg 


He 


Gly 


Gin 


Leu 


145 










150 










155 










160 


Ala 


Cys 


Trp 


Glu 


His 
165 


Tyr 


Leu 


Pro 


Leu 


Ala 
170 


Arg 


Tyr 


Ala 


Leu 


He 
175 


Ala 


Asp 


Gly 


Glu 


Gin 


He 


His 


Ser 


Ala 


Met 


Tyr 


Pro 


Gly 


Ser 


Phe 


Ala 


Gly 




180 










185 










190 






Pro 


Leu 


Phe 
195 


Ala 


Glu 


Gin 


He 


Glu 
200 


Val 


Ser 


He 


Arg 


Gin 
205 


His 


Ala 


Leu 


Glu 


Ser 
210 


Ala 


Cys 


Phe 


Val 


Val 
215 


Asn 


Ala 


Thr 


Gly 


Trp 

220 


Leu 


Ser 


Ala 


Glu 


Gin 


Gin 


Ala 


Gin 


He 


Val 


Lys 


Asp 


Thr 


Gly 


cys 


Val 


Val 


Gly 


Pro 


lie 


225 










230 










235 










2 4 0 


Ser 


Gly 


Gly 


Cys 


Phe 
245 


Thr 


Ala 


He 


Val 


Asp 
250 


Pro 


Glu 


Gly 


Arg 


lie 

o c c 


Met 


Gly Ala 


Pro 


Leu 


Lys 


Ala 


Gly 


Glu 


Gly 


Glu 


Val 


He 


Ala 


Asp 


Leu 


Asp 








260 










265 










270 






Phe 


Ala 


Gin 
275 


He 


Asp 


Phe 


Arg 


Lys 
280 


Arg 


Val 


Met 


Asp 


Thr 
285 


Arg 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


Leu 


Thr 


He 


Asp 


Arg 


Ser 


Gin 


His 


290 










295 










300 










His 


His 


Met 


Thr 


Glu 


Arg 


Gly Ala 


Asp 


His 


Arg 


Val 


Asp 


His 


Ala 


Lys 


305 










310 










315 










320 


Pro 


Thr 


Val 


Thr 


Ala 
325 


Glu 


Gin 


Ser 


Ala 


Val 
330 


Glu 


Pro 


Ala 


Glu 







<210> 25 
<211> 939 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 25 

gtgtcatcaa 

gcgtcgctgg 

gtcgcgttcg 

gcgatcatct 

gtcgccgttc 

gtattatgca 

aacacgctcc 

ccaacctatg 

gatagtacgg 

cgccaactca 

gaaatgcacc 

gccggattga 

gccgacaagc 

aaatatctcg 

actgagaaca 

ctgtttgact 



ctatcaaagt 
caaaggccgc 
gggagacatg 
gggataaccc 
ccggcaggga 
tcggcgtcaa 
tgacgtttga 
gcgagagatt 
tcgggcgcat 
tgcacaacag 
agatcgcgag 
tcatgcccgc 
gcgactatct 
ccgggccggt 
tcaaggagca 
tgaatgtggt 



cgccattatt 
cagtctgatc 
gctgccgggc 
cgccaccaaa 
aacggcattt 
tgagaaggtc 
tgcaacgggt 
ggtatggggg 
cggcgggctg 
cggcgagcag 
ccgtcattat 
gttcgacctg 
cctaatgaac 
ttatgacgaa 
gatgacgctg 
gcggcggcgg 



caggccgctc 
cgcgaggcgg 
tatccgatgt 
accgtcttcg 
ctcgccgacc 
atgaatgggc 
gaaatcatca 
ccgggcgacg 
atctgttggg 
attcacgtct 
gcattcgagg 
cccagcgaac 
ggcggcagcg 
gagactattc 
gacgtgacag 
aatgcgtag 



ctgcttacta 
cacgcggcgg 
ggctggattg 
cgcgcctcca 
ttgcgatgtc 
cgggacacgg 
atcatcatcg 
cagttggcgt 
agcactggat 
gcgcatggcc 
gccgctgctt 
tcgaatttcc 
ccatcatcaa 
tctgcgccga 
ggcattatgc 



cgacctgcag 
cgcgcaattc 
gtgtcctggc 
tgaaaacagc 
gttaagcatc 
cacgctctac 
caagttgatg 
gcaagcggtt 
gccgctgcca 
gggcgtgcac 
tgtgctggcc 
gcccgaactg 
gcccaatggc 
ccttgacctg 
gcgagcggaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
939 



<210> 26 
<211> 312 
<212> PRT 
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<213> Unknown 
<220> 

<223> Obtained from an environmental sample 
<400> 26 

Val Ser Ser Thr lie Lys Val Ala He He Gin Ala Ala Pro Ala Tyr 

15 10 15 

Tvr Asp Leu Gin Ala Ser Leu Ala Lys Ala Ala Ser Leu He Arg Glu 

20 25 30 

Ala Ala Arg Gly Gly Ala Gin Phe Val Ala Phe Gly Glu Thr Trp Leu 

35 40 45 

Pro Gly Tyr Pro Met Trp Leu Asp Trp Cys Pro Gly Ala He He Trp 

50 55 60 

Asp Asn Pro Ala Thr Lys Thr Val Phe Ala Arg Leu His Glu Asn Ser 
65 70 75 80 

Val Ala Val Pro Gly Arg Glu Thr Ala Phe Leu Ala Asp Leu Ala Met 

85 90 95 

Ser Leu Ser He Val Leu Cys He Gly Val Asn Glu Lys Val Met Asn 

100 105 HO 

Glv Pro Gly His Gly Thr Leu Tyr Asn Thr Leu Leu Thr Phe Asp Ala 

115 120 125 

Thr Gly Glu He He Asn His His Arg Lys Leu Met Pro Thr Tyr Gly 

130 135 140 

Glu Arg Leu Val Trp Gly Pro Gly Asp Ala Val Gly Val Gin Ala Val 
145 150 155 160 

Asp Ser Thr Val Gly Arg He Gly Gly Leu He Cys Trp Glu His Trp 

165 170 175 

Met Pro Leu Pro Arg Gin Leu Met His Asn Ser Gly Glu Gin He His 

180 185 190 

Val Cys Ala Trp Pro Gly Val His Glu Met His Gin He Ala Ser Arg 

195 200 205 

His Tyr Ala Phe Glu Gly Arg Cys Phe Val Leu Ala Ala Gly Leu He 

210 215 220 

Met Pro Ala Phe Asp Leu Pro Ser Glu Leu Glu Phe Pro Pro Glu Leu 
225 230 235 240 

Ala Asp Lys Arg Asp Tyr Leu Leu Met Asn Gly Gly Ser Ala He He 

245 250 255 

Lys Pro Asn Gly Lys Tyr Leu Ala Gly Pro Val Tyr Asp Glu Glu Thr 

260 265 270 

He Leu Cys Ala Asp Leu Asp Leu Thr Glu Asn He Lys Glu Gin Met 

275 280 285 

Thr Leu Asp Val Thr Gly His Tyr Ala Arg Ala Glu Leu Phe Asp Leu 

290 295 300 

Asn Val Val Arg Arg Arg Asn Ala 
305 310 

<210> 27 
<211> 1056 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<400> 27 , ... . 

atgccaaccc ccagcgatca tttcaaaatc gccgctgttc aggcctcgcc cgtgtttctg 

gaccgggagg ccactgtgga aaaggcctgc cggttgatcg ccgaagccgc aaagcagggc 

gcccgcctca tcgtctttcc ggaatctttc atcccgacct acccggattg ggtgtgggcc 

gttcccccgg gaagggaaag aatcctgaac cagctgtatt ctgaattcct ggccaatgcc 

gtcgatgttc ccggcgcggc gaccgaacaa cttgcccagg ctgcacgaat ggccggcgcc 



60 
120 
180 
240 
300 
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tatgtgatta 
accctgctct 
acggggggcg 
actccgctgg 
tacacgatgt 
ccgtggctct 
tgcatcgccc 
tccgggtccc 
ttcatcgccg 
cagatgcggg 
ttcgagctca 
acggggatcg 
aaatcggcaa 



tgggcgtcac 
acttcagccc 
aacggctggt 
gaaagatcgg 
acgcctgggg 
ccaccctgcg 
tgcgccaggg 
gggagtggat 
ggccggtgcg 
gccccaagtg 
tcgtccaccg 
cctcaagttt 
gaagcaaacg 



cgaaagagac 
cgaaggcatc 
ctgggcctac 
cgggctgatc 
cacccagatt 
gcatatcgcc 
ggatatcccg 
caatgagggc 
gatgaaggag 
gatgctcgat 
gaatccccac 
gattcgcccc 
caagcccaaa 



acctcggcca 
ctaatgggca 
ggagacggca 
tgctgggaga 
tacatcgccg 
aaggaaggaa 
gaccggttcg 
gacagcgcca 
gagatcctgt 
gtggccggtc 
ccgatgatca 
cgccctaacc 
aaatga 



gcggggccag 
aacaccggaa 
gcacgctgga 
actacatgcc 
ccacctggga 
gggtctacgt 
agtacaaggg 
tcgtgaaccc 
atgccgagat 
attacgcccg 
aaatcgccga 
ttcccccatc 



cctctacaac 
gctggttccc 
ggtctacgac 
cctggcccgg 
ccgcggggaa 
catcgggtgc 
aaaattttat 
ggacggggaa 
agacccccgg 
gccggatatc 
agacaggggc 
aagggggagg 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1056 



<210> 28 
<211> 351 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 28 





Pro 


i. LIS. 


Prn 
XT X \J 


Ser 


Asp 


His 


Phe 


Lys 


He 


Ala 


Ala 


Val 


Gin 


Ala 


Ser 










5 








10 










15 




Pro 


V d J_ 


Phe 


Leu 
20 


Asp 


Arg 


Glu 


Ala 


Thr 

25 


Val 


Glu 


Lvs 


Ala 


Cys 
30 


Arg 


Leu 


Tip 


Ala 


Glu 
35 


Ala 


Ala 


Lys 


Gin 


Glv 
40 


Ala 


Arg 


Leu 


He 


Val 
45 


Phe 


Pro 


Glu 


Ser 


Phe 
50 


He 


Pro 


Thr 


Tyr 


Pro 
55 


Asp 


Trp 


Val 


Trp 


Ala 
60 


Val 


Pro 


Pro 


Gly 


Arg 


Glu 


Arg 


He 


Leu 


Asn 


Gin 


Leu 


Tyr 


Ser 


Glu 


Phe 


Leu 


Ala 


Asn 


Ala 


65 








70 










75 










80 


Val 


Asp 


Val 


Pro 


Gly 


Ala 


Ala 


Thr 


Glu 


Gin 


Leu 


Ala 


Gin 


Ala 


Ala 


Arg 








85 










90 










95 




Met 


Ala 


Gly 


Ala 


Tyr 


Val 


He 


Met 


Gly 


Val 


Thr 


Glu 


Arg 


Asp 


Thr 


Ser 






100 










105 










110 






Ala 


Ser 


Gly 
115 


Ala 


Ser 


Leu 


Tyr 


Asn 
120 


Thr 


Leu 


Leu 


Tyr 


Phe 
125 


Ser 


Pro 


Glu 


Gly 


He 


Leu 


Met 


Gly 


Lys 


His 


Arg 


Lys 


Leu 


Val 


Pro 


Thr 


Gly 


Gly 


Glu 


130 










135 










140 










Arg 


Leu 


Val 


Trp 


Ala 


Tyr 


Gly 


Asp 


Gly 


Ser 


Thr 


Leu 


Glu 


Val 


Tyr 


Asp 


145 










150 










155 










160 


Thr 


Pro 


Leu 


Gly 


Lys 
165 


He 


Gly 


Gly 


Leu 


He 
170 


Cys 


Trp 


Glu 


Asn 


Tyr 
175 


Met 


Pro 


Leu 


Ala 


Arg 
180 


Tyr 


Thr 


Met 


Tyr 


Ala 
185 


Trp 


Gly 


Thr 


Gin 


He 
190 


Tyr 


He 


Ala 


Ala 


Thr 


Trp 


Asp 


Arg 


Gly 


Glu 


Pro 


Trp 


Leu 


Ser 


Thr 


Leu 


Arg 


His 






195 








200 










205 








He 


Ala 


Lys 


Glu 


Gly Arg 


Val 


Tyr 


Val 


He 


Gly 


Cys 


Cys 


He 


Ala 


Leu 




210 








215 










220 










Arg 


Gin 


Gly 


Asp 


He 


Pro 


Asp 


Arg 


Phe 


Glu 


Tyr 


Lys 


Gly 


Lys 


Phe 


Tyr 


225 










230 










235 










240 


Ser 


Gly 


Ser 


Arg 


Glu 


Trp 


He 


Asn 


Glu 


Gly Asp 


Ser 


Ala 


He 


Val 


Asn 








245 










250 










255 




Pro 


Asp 


Gly 


Glu 


Phe 


He 


Ala 


Gly 


Pro 


Val 


Arg 


Met 


Lys 


Glu 


Glu 


He 




260 










265 










270 






Leu 


Tyr 


Ala 


Glu 


He 


Asp 


Pro 


Arg 


Gin 


Met 


Arg 


Gly 


Pro 


Lys 


Trp 


Met 




275 










280 










285 








Leu 


Asp 
290 


Val 


Ala 


Gly 


His 


Tyr 
295 


Ala 


Arg 


Pro 


Asp 


He 
300 


Phe 


Glu 


Leu 


He 
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Val His Arg Asn Pro His Pro Met lie Lys lie Ala Glu Asp Arg Gly 
305 310 315 320 

Thr Gly lie Ala Ser Ser Leu He Arg Pro Arg Pro Asn Leu Pro Pro 

325 330 335 

Ser Arg Gly Arg Lys Ser Ala Arg Ser Lys Arg Lys Pro Lys Lys 

340 345 350 



<210> 29 
<211> 1017 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 29 

atgggcatcg 

gacctcgagg 

gccaagctga 

gactcgccgg 

tcctacgaca 

gccgtgatcg 

gggcccgacg 

accgtctatg 

cgcttgggcg 

gcccagaacg 

gccccggcgc 

tcctgcttcg 

gatcggcccg 

cccgacggca 

gagattgatc 

tcacggccgg 

tttgccttgc 



aacatacgaa 
cctcgatcgg 
tcgcctttcc 
cctgggcgat 
gtccccaggc 
gcctgtccga 
gcgaaaccat 
gcgaaggcga 
cgctgtgctg 
agcaggtcca 
tcggcgccga 
tgctcgcgcc 
ataagcatgc 
gccagatcgg 
tgggcgccat 
atgtgacgcg 
ccgccgacat 



atacaaggtg 
caagtccatc 
ggaggccttc 
cggccgcggc 
cgagcggctg 
acgcgacggc 
tgccaagcgc 
cggcagcgat 
ctgggagcat 
cgtcgctgcc 
ggtcaacaac 
ttgcgcgacg 
gctgctgcat 
cgagaagctg 
cggtgttgcc 
gttgctgctc 
ggtcgagccc 



gcggtggtgc 
ggcctaatca 
atccccggtt 
ttcgtccagc 
cgtgatgccg 
ggcagccttt 
cgcaagctgc 
ctggccgtac 
cttcagccgt 
tggccgagct 
gctgcctcgc 
gtgtcgcagg 
gccggcggag 
gcgccggatc 
aagaacgcgg 
aacaagaagc 
gcggacatag 



aggcggcgcc 
aggaggccgc 
acccctggta 
gctatttcga 
tgcgccaggc 
acctggcgca 
ggccgaccca 
atgcccggcc 
tgtcgaagta 
tctcgctcta 
gcgtctatgc 
ccatgatcga 
gctttgccgc 
aggagggtct 
cagatcccgc 
ggtaccagcg 
gcgcggcggc 



ggcctggctc 
ggacaagggc 
tatctggatg 
caattcgctc 
caagctcacc 
atggttgatc 
tgccgagcgc 
cgacatcggt 
cgcaatgtac 
cgatcccttc 
ggtggagggc 
cgaactctgc 
gatctacggc 
gctgatcgcc 
cggtcattat 
cgtcgagcaa 
gagctga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1017 



<210> 30 
<211> 338 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 30 



Met 


Gly 


He 


Glu 


His 


Thr 


Lys 


Tyr 


1 








5 








Pro 


Ala 


Trp 


Leu 


Asp 


Leu 


Glu 


Ala 








20 










He 


Lys 


Glu 


Ala 


Ala 


Asp 


Lys 


Gly 






35 










40 


Ala 


Phe 


He 


Pro 


Gly 


Tyr 


Pro 


Trp 




50 










55 




Trp 


Ala 


He 


Gly 


Arg 


Gly 


Phe 


Val 


65 










70 






Ser 


Tyr 


Asp 


Ser 


Pro 
85 


Gin 


Ala 


Glu 


Ala 


Lys 


Leu 


Thr 


Ala 


Val 


He 


Gly 








100 










Leu 


Tyr 


Leu 


Ala 


Gin 


Trp 


Leu 


lie 






115 










120 


Lys 


Arg 


Arg 


Lys 


Leu 


Arg 


Pro 


Thr 




130 










135 





Lys 


Val 


Ala 


Val 


Val 


Gin 


Ala 


Ala 


10 










15 




Ser 


He 


Gly 


Lys 


Ser 


He 


Gly 


Leu 


25 










30 






Ala 


Lys 


Leu 


He 


Ala 
45 


Phe 


Pro 


Glu 


Tyr 


He 


Trp 


Met 
60 


Asp 


Ser 


Pro 


Ala 


Gin 


Arg 


Tyr 
75 


Phe 


Asp 


Asn 


Ser 


Leu 
80 


Arg 


Leu 
90 


Arg 


Asp 


Ala 


Val 


Arg 
95 


Gin 


Leu 


Ser 


Glu 


Arg 


Asp 


Gly 


Gly 


Ser 


105 










110 






Gly 


Pro 


Asp 


Gly 


Glu 
125 


Thr 


He 


Ala 


His 


Ala 


Glu 


Arg 
140 


Thr 


Val 


Tyr 


Gly 
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Glu Gly Asp Gly 
145 

Arg Leu Gly Ala 

Tyr Ala Met Tyr 
180 

Ser Phe Ser Leu 
195 

Asn Asn Ala Ala 
210 

Leu Ala Pro Cys 

225 

Asp Arg Pro Asp 

Ala He Tyr Gly 
260 

Asp Gin Glu Gly 
275 

Val Ala Lys Asn 
290 

Val Thr Arg Leu 
305 

Phe Ala Leu Pro 
Ala Ser 



Ser Asp Leu Ala 
150 

Leu Cys Cys Trp 
165 

Ala Gin Asn Glu 

Tyr Asp Pro Phe 
200 

Ser Arg Val Tyr 
215 

Ala Thr Val Ser 
230 

Lys His Ala Leu 
245 

Pro Asp Gly Ser 

Leu Leu He Ala 
280 

Ala Ala Asp Pro 
295 

Leu Leu Asn Lys 
310 

Ala Asp Met Val 
325 



Val His Ala Arg 
155 

Glu His Leu Gin 
170 

Gin Val His Val 
185 

Ala Pro Ala Leu 

Ala Val Glu Gly 

220 

Gin Ala Met He 
235 

Leu His Ala Gly 
250 

Gin He Gly Glu 
265 

Glu He Asp Leu 

Ala Gly His Tyr 
300 

Lys Arg Tyr Gin 
315 

Glu Pro Ala Asp 
330 



Pro Asp He Gly 
160 

Pro Leu Ser Lys 
175 

Ala Ala Trp Pro 
190 

Gly Ala Glu Val 
205 

Ser Cys Phe Val 

Asp Glu Leu Cys 
240 

Gly Gly Phe Ala 
255 

Lys Leu Ala Pro 

270 

Gly Ala He Gly 
285 

Ser Arg Pro Asp 

Arg Val Glu Gin 
320 

He Gly Ala Ala 
335 



<210> 31 
<211> 933 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 31 

atgaccagaa tagccattat tcagcgaccg cccgtgctgc tcgatcgaag cgccaccatt 
gcccgggccg tgcaatcggt cgccgaagcg gcagcgcaag gcgcgaccct gattgtcttg 
cccgaatcgt acatccctgg ctatccctca tggatctggc ggctcgcgcc tggcaaagac 18 0 

ggcgcgatcg tgggccagtt gcatgcgcgc ttgctggcca atgcggtcga cctgagcagc 240 
actgacctcg atgcgcttct tgaagcggcc cgtcagcacg gcgtgaccat tgtttgcggc 300 
atgaacgagt gcgaacggcg tcgcggcggc ggcaccttgt acaacacggt ggtcgtgatc 
ggaccggacg gcgtcatgct caaccggcat cgcaaattga tgccgaccaa tcccgagcgc 
atggtgcatg gctttggcga tgcatccgga ctgaaagcag ttgatacgcc tgccggccgg 
ctgggcacgc tgatctgctg ggagagctac atgccgctgg cacgctatgc cctgtacgag 
caaggcatcg agatctacat cgcaccaact tatgacagtg gtgacggctg gatcagcacc 
atgcgccaca ttgcactcga agggcgctgc tgggtgattg gcagcggcac ggtcctgaaa 
ggcagtgata ttccggacga tttcccggaa cgggcacgcc tgttccctga tccggatgag 
tggatcaacg atggtgattc ggtagttatc gatccgcagg gaaagatcgt tgccggtccg 
atgcgtaggg aagcaggcat tctatacgcc gatatcgacg tcgcgcgcgt agcaccatca 
cgccgcacgc tggatgtcgc ggggcattac gcgcgtccgg acgtcttcga gcttcgggta 
caccaggcac cgggggcacg agtaagtaat tga 

<210> 32 
<211> 310 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



60 
120 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
933 
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<400> 32 ^ 

Met Thr Arg He Ala He He Gin Arg Pro Pro Val Leu Leu Asp Arg 

1 5 10 15 

Ser Ala Thr He Ala Arg Ala Val Gin Ser Val Ala Glu Ala Ala Ala 

20 25 30 

Gin Gly Ala Thr Leu He Val Leu Pro Glu Ser Tyr He Pro Gly Tyr 

35 40 45 

Pro Ser Trp He Trp Arg Leu Ala Pro Gly Lys Asp Gly Ala He Val 

50 55 60 

Glv Gin Leu His Ala Arg Leu Leu Ala Asn Ala Val Asp Leu Ser Ser 
65 70 75 80 

Thr Asp Leu Asp Ala Leu Leu Glu Ala Ala Arg Gin His Gly Val Thr 

85 90 95 

He Val Cys Gly Met Asn Glu Cys Glu Arg Arg Arg Gly Gly Gly Thr 

100 105 HO 

Leu Tyr Asn Thr Val Val Val He Gly Pro Asp Gly Val Met Leu Asn 

115 120 125 

Arq His Arg Lys Leu Met Pro Thr Asn Pro Glu Arg Met Val His Gly 

130 135 140 

Phe Gly Asp Ala Ser Gly Leu Lys Ala Val Asp Thr Pro Ala Gly Arg 
145 150 155 1^0 

Leu Gly Thr Leu He Cys Trp Glu Ser Tyr Met Pro Leu Ala Arg Tyr 

165 170 175 

Ala Leu Tyr Glu Gin Gly He Glu He Tyr lie Ala Pro Thr Tyr Asp 

180 185 190 

Ser Gly Asp Gly Trp He Ser Thr Met Arg His He Ala Leu Glu Gly 

195 200 205 

Arg Cys Trp Val He Gly Ser Gly Thr Val Leu Lys Gly Ser Asp He 

210 215 220 

Pro Asp Asp Phe Pro Glu Arg Ala Arg Leu Phe Pro Asp Pro Asp Glu 
225 230 235 24U 

Trp He Asn Asp Gly Asp Ser Val Val He Asp Pro Gin Gly Lys He 

245 250 255 

Val Ala Gly Pro Met Arg Arg Glu Ala Gly He Leu Tyr Ala Asp He 

260 265 270 

Asp Val Ala Arg Val Ala Pro Ser Arg Arg Thr Leu Asp Val Ala Gly 

275 280 285 

His Tyr Ala Arg Pro Asp Val Phe Glu Leu Arg Val His Gin Ala Pro 

290 295 300 

Gly Ala Arg Val Ser Asn 
305 310 

<210> 33 
<211> 1026 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

at-g£aagtc ccgtgacgca gtatcgcgcc gccgcggtgc aggcggcgcc atcttttctc 60 
gatctcgacc gcaccgtcga gaagacgatc gcgatcatcg agcaggcggc cgagcaggat 120 
gtgcgcctga tcgcgtttcc ggaaacctgg attcccggct atccgctctg S*tctggctc 180 
ggctcgccgg cctggggcat gcgcttcgtg cagcgctatt tcgagaactc ^tggtgcgc 
ggcagcaaac agtggaacgc gatcgcegat gcggcgcggc gccaccgcat gaccgtcgtc 
g?cggcttca gcgagcgcgc gggaggcagc ctctacatgg gccaggcgat cttcggcccc 
gaaggcgagc tcatcgcggc gcgccggaag ctcaagccga cacacgccga gcgaacggtg 
?tcggcgagg gcgacggcag ccacttggcc gtttacgaga cgggcgttgg tcgcatcggc 
ccc?tgc? gc^gggagca catccagccg ctctcgaaat acgcgatgta tgcggccaac 
gaacagg?gc atgtggcctc gtggccgtgc ttcagccttt atcgcggcat ggcctatgcg 



240 
300 
360 
420 
480 
540 
600 
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ctcgggccgg aggtgaacac cgccgcgagc caggtctacg cggtcgaggg cggctgctac 660 

gtgctggcct cctgtctcgt cgtgacaccc gagatcctga aggtgctgat cgacacgccc 720 

gacaaggagc cgttgctgct cgccggcggg gggttctcga tgatcttcgg ccccgacggc 7 80 

cgcgcgctcg cccagccgct gccggagacc gaagaggggc tcgtcacggc cgagatcgat 840 

ctcggcgcga tcgcgctcgc caaggccgcg gccgatcccg ccggccatta cgcgcggccc 900 

gacgtgacgc ggttgttgct gaacccgcgc cccgcggcgc gcgtcgaagc gctgggtccg 960 

cgcttcgagg tcgtgcagag cgagcaggcc gagccgccca cgcaaccggc cgaagcggcg 1020 

gattga 1026 

<210> 34 
<211> 341 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 34 

Met Leu Ser Pro Val Thr Gin Tyr Arg Ala Ala Ala Val Gin Ala Ala 

1 5 10 15 

Pro Ser Phe Leu Asp Leu Asp Arg Thr Val Glu Lys Thr lie Ala lie 

20 25 30 

lie Glu Gin Ala Ala Glu Gin Asp Val Arg Leu lie Ala Phe Pro Glu 

35 40 45 

Thr Trp lie Pro Gly Tyr Pro Leu Trp lie Trp Leu Gly Ser Pro Ala 

50 55 60 

Trp Gly Met Arg Phe Val Gin Arg Tyr Phe Glu Asn Ser Leu Val Arg 
65 70 75 80 

Gly Ser Lys Gin Trp Asn Ala He Ala Asp Ala Ala Arg Arg His Arg 

85 90 95 

Met Thr Val Val Val Gly Phe Ser Glu Arg Ala Gly Gly Ser Leu Tyr 

100 105 HO 

Met Gly Gin Ala He Phe Gly Pro Glu Gly Glu Leu lie Ala Ala Arg 

115 120 125 

Arg Lys Leu Lys Pro Thr His Ala Glu Arg Thr Val Phe Gly Glu Gly 

130 135 140 

Asp Gly Ser His Leu Ala Val Tyr Glu Thr Gly Val Gly Arg He Gly 
145 150 155 160 

Ala Leu Cys Cys Trp Glu His He Gin Pro Leu Ser Lys Tyr Ala Met 

165 170 175 

Tyr Ala Ala Asn Glu Gin Val His Val Ala Ser Trp Pro Cys Phe Ser 

180 185 190 

Leu Tyr Arg Gly Met Ala Tyr Ala Leu Gly Pro Glu Val Asn Thr Ala 

195 200 205 

Ala Ser Gin Val Tyr Ala Val Glu Gly Gly Cys Tyr Val Leu Ala Ser 

210 215 220 

Cys Leu Val Val Thr Pro Glu He Leu Lys Val Leu He Asp Thr Pro 
225 230 235 240 

Asp Lys Glu Pro Leu Leu Leu Ala Gly Gly Gly Phe Ser Met He Phe 

245 250 255 

Gly Pro Asp Gly Arg Ala Leu Ala Gin Pro Leu Pro Glu Thr Glu Glu 

260 265 270 

Gly Leu Val Thr Ala Glu He Asp Leu Gly Ala He Ala Leu Ala Lys 

275 280 285 

Ala Ala Ala Asp Pro Ala Gly His Tyr Ala Arg Pro Asp Val Thr Arg 

290 295 300 

Leu Leu Leu Asn Pro Arg Pro Ala Ala Arg Val Glu Ala Leu Gly Pro 
305 310 315 320 

Arg Phe Glu Val Val Gin Ser Glu Gin Ala Glu Pro Pro Thr Gin Pro 

325 330 335 

Ala Glu Ala Ala Asp 
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340 

<210> 35 
<211> 942 
<212> DNA 
<213> Unknown 

<220> 

<22 3> Obtained from an environmental sample 



<400> 35 

atgacagctc taaaaattgc tgctgttcaa atgtgcgccg aattgggcgc tacagatcga 6 0 

aacctgagtg cagctggatc attcgtgcgc gacgcatttc gcgaaggtgc ccagtgggta 12 0 

atcctcccag agttttttac ctcggcaatg gcattcgcac cttcgatggc gcaagcttgg 180 

ttgccactgg aaggaaaggc gctagcgatg atgcgcagcc ttgcgcgtca attcgatggg 240 

gttgttggag gctcatatgt tgccagagag gggaacgact gcgtaaatgc ctttcttctc 300 

gtctttccgg atggaagcta ctaccggcat gacaaagata ttccaacaat gtgggagaac 360 

tgttactaca tcggcggcgt cgacgatggg gtgctggaaa caccaattgg tgcggtggga 42 0 

gttgcactgt gttgggagtt catccgaaca caaaccgccc gaagactgaa ggatcgcgtt 480 

caattagtgg ttggcggtac ttgctggtgg gattttccga tgcctgtacc tgaacgatat 54 0 

ctgaggctga ccaggcatat ctccaggaac tttgagcgcg atgctccggc gcggttggcc 600 

agtatgttgg gtgtgcctgt tgtacacgct tcccatgctg gggattttac tgctgtcacc 660 

ccaggcaatg aaacgaagaa ttaccgatcc aactatctgg gagagaccca gatcgtcgat 72 0 

gccaatggaa atgtgttgaa gcgaatgaca gtggctgatg gtgagggtta cgtcattgct 780 

gacgttcaat tgggggccat atcaaccggt cgaacttcga tccccgacac cttctggacc 84 0 

tgcaagctaa cgccaggggc acaacaggct tgggatgaac aaaatgcttt tgggtgtggc 90 0 

tactatgaga acgtcacacg caaacaccta atcggtcgat ga 942 



<210> 36 
<211> 313 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 36 



Met 


Thr 


Ala 


Leu 


Lys 


lie 


Ala 


Ala 


Val 


Gin 


Met 


Cys 


Ala 


Glu 


Leu 


Gly 


1 








5 










10 










15 




Ala 


Thr 


Asp 


Arg 


Asn 


Leu 


Ser 


Ala 


Ala 


Gly 


Ser 


Phe 


Val 


Arg 


Asp 


Ala 






20 










25 










30 






Phe 


Arg 


Glu 
35 


Gly 


Ala 


Gin 


Trp 


Val 
40 


lie 


Leu 


Pro 


Glu 


Phe 
45 


Phe 


Thr 


Ser 


Ala 


Met 
50 


Ala 


Phe 


Ala 


Pro 


Ser 
55 


Met 


Ala 


Gin 


Ala 


Trp 
60 


Leu 


Pro 


Leu 


Glu 


Gly 


Lys 


Ala 


Leu 


Ala 


Met 


Met 


Arg 


Ser 


Leu 


Ala 


Arg 


Gin 


Phe 


Asp 


Gly 


65 










70 










75 










80 


Val 


Val 


Gly 


Gly 


Ser 
85 


Tyr 


Val 


Ala 


Arg 


Glu 
90 


Gly 


Asn 


Asp 


Cys 


Val 
95 


Asn 


Ala 


Phe 


Leu 


Leu 
100 


Val 


Phe 


Pro 


Asp 


Gly 
105 


Ser 


Tyr 


Tyr 


Arg 


His 
110 


Asp 


Lys 


Asp 


lie 


Pro 


Thr 


Met 


Trp 


Glu 


Asn 


Cys 


Tyr 


Tyr 


He 


Gly Gly Val 


Asp 






115 










120 










125 








Asp 


Gly 
130 


Val 


Leu 


Glu 


Thr 


Pro 
135 


lie 


Gly 


Ala 


Val 


Gly 
140 


Val 


Ala 


Leu 


Cys 


Trp 


Glu 


Phe 


lie 


Arg 


Thr 


Gin 


Thr 


Ala 


Arg 


Arg 


Leu 


Lys 


Asp 


Arg 


Val 


145 










150 










155 










160 


Gin 


Leu 


Val 


Val 


Gly 
165 


Gly 


Thr 


Cys 


Trp 


Trp 
170 


Asp 


Phe 


Pro 


Met 


Pro 
175 


Val 


Pro 


Glu 


Arg 


Tyr 
180 


Leu 


Arg 


Leu 


Thr 


Arg 
185 


His 


lie 


Ser 


Arg 


Asn 
190 


Phe 


Glu 
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Arg 


Asp 


Ala 


Pro 


Ala 


Arg 


Leu 


Ala 


Ser 


Met 


Leu 


Gly 


Val 


Pro Val 


Val 


195 










200 










205 






His 


Ala 


Ser 


His 


Ala 


Gly 


Asp 


Phe 


Thr 


Ala 


Val 


Thr 


Pro 


Gly Asn 


Glu 




210 










215 










220 








Thr 


Lys 


Asn 


Tyr 


Arg 


Ser 


Asn 


Tyr 


Leu 


Gly 


Glu 


Thr 


Gin 


He Val 


Asp 


225 






230 










235 








Z 4 U 


Ala 


Asn 


Gly 


Asn 


Val 


Leu 


Lys 


Arg 


Met 


Thr 


Val 


Ala 


Asp 


Gly (ji.U 


bJLy 








245 










250 








ZOO 




Tyr 


Val 


He 


Ala 


Asp 


Val 


Gin 


Leu 


Gly 


Ala 


He 


Ser 


Thr 


n » * 7\ v™ /t 
(jJLy AJ-y 


± 11 -L 






260 










265 










270 




Ser 


lie 


Pro 


Asp 


Thr 


Phe 


Trp 


Thr 


Cys 


Lys 


Leu 


Thr 


Pro 


Gly Ala 


Gin 






275 










280 










285 






Gin 


Ala 


Trp 


Asp 


Glu 


Gin 


Asn 


Ala 


Phe 


Gly 


Cys 


Gly 


Tyr 


Tyr Glu 


Asn 




290 






295 










300 








Val 


Thr 


Arg 


Lys 


His 


Leu 


He 


Gly 


Arg 














305 










310 





















<210> 37 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 37 

atggattcag 

ttaaatctcg 

ggcgcgaaag 

tactctccaa 

gtcgataatt 

aagaccggcg 

acgacgatct 

acctataccg 

acggattatg 

gccgcaatgc 

atccatcaga 

ggctctgttc 

agtgatgcat 

gggggaagcg 

ccctccatta 

gatacaagcg 

caacatgatg 



atatgacgga 
aagagagtct 
tgatcgcctt 
aagcgggtct 
cagtcacctt 
catatgttgt 
atgttgggat 
aaagattgat 
gcgttcttgg 
atgccagata 
tagcaagcag 
tgactcgaag 
tggaacttct 
cgataattgc 
tttatgctga 
gacactactc 
taactttcag 



tacttttaag 
ggacaaagcc 
tcctgaaaca 
gtgggactat 
acccggcaaa 
aatgggggca 
tgatgggaag 
ctgggggcgg 
aggattgatc 
tgaaaccctt 
acattatgct 
agatataata 
gaaagctatt 
gccgaatgga 
aattgatcct 
gcgccctgac 
atcggggcat 



gcagccatta 
ggcagcctta 
tggctgcccg 
cagcctgcaa 
cacctcgatc 
cacgaacgag 
gagtacaaac 
ggagacggca 
tgctgggagc 
catgtggcgc 
tttgaaggcc 
gaaggattca 
tcgggagaag 
gagtatcttg 
gcactgataa 
atttttcgtc 
tag 



ttcagcacgc 
tagaaaaggc 
gttatcccgt 
aatctctcta 
aactcctctc 
tgggtggaac 
ttcatagaaa 
gcacattgag 
actggatgcc 
aatggccggc 
ggtgtttcgt 
actcactggc 
atagtgatct 
cgggcccggt 
gtgagggcca 
tggagataaa 



gcctgttttt 
tgccgatcaa 
atggctcgac 
tcgtctgcta 
catagcgcaa 
actctataac 
gctggtgccg 
tgtgttgatg 
tctggcaaga 
tgtaaaagat 
gctcgcggca 
tcgcgccgat 
tattttgaat 
ctttaatgaa 
tcttacactg 
cgatcaacct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
993 



<210> 38 
<211> 330 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
<400> 38 

Met Asp Ser Asp Met Thr Asp Thr Phe Lys Ala Ala He He Gin His 

15 10 15 

Ala Pro Val Phe Leu Asn Leu Glu Glu Ser Leu Asp Lys Ala Gly Ser 

20 25 30 

Leu He Glu Lys Ala Ala Asp Gin Gly Ala Lys Val He Ala Phe Pro 

35 40 45 

Glu Thr Trp Leu Pro Gly Tyr Pro Val Trp Leu Asp Tyr Ser Pro Lys 
50 55 60 
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Ala 


Gly 


Leu 


Trp 


Asp 


Tyr 


Gin 


Pro 


Ala 


Lys 


Ser 


Leu 


Tyr 


Arg 


Leu 


Leu 


65 










70 










75 










80 


Val 


Asp 


Asn 


Ser 


Val 


Thr 


Leu 


Pro 


Gly 


L ys 


His 


Leu 


Asp 


Gin 


Leu 


Leu 










85 










90 










95 




Ser 


He 


Ala 


Gin 


Lys 


Thr 


Gly 


Ala 


Tyr 


Val 


Val 


Met 


Gly 


Ala 


His 


Glu 








100 










105 










110 






Arg 


Val 


Gly 


Gly 


Thr 


Leu 


Tyr 


Asn 


Thr 


Thr 


He 


Tyr 


Val 


Gly 


He 


Asp 






115 










120 










125 








Gly 


Lys 


Glu 


Tyr 


Lys 


Leu 


His 


Arg 


Lys 


Leu 


Val 


Pro 


Thr 


Tyr 


Thr 


Glu 




130 










135 










140 










Arg 


Leu 


lie 


Trp 


Gly 


Arg 


Gly 


Asp 


Gly 


Ser 


Thr 


Leu 


Ser 


Val 


Leu 


Met 


145 










150 










155 










160 


Thr 


Asp 


Tyr 


Gly 


Val 


Leu 


Gly 


Gly 


Leu 


He 


Cys 


Trp 


Glu 


His 


Trp 


Met 










165 










170 










175 




Pro 


Leu 


Ala 


Arg 


Ala 


Ala 


Met 


His 


Ala 


Arg 


Tyr 


Glu 


Thr 


Leu 


His 


Val 








180 










185 










190 






Ala 


Gin 


Trp 


Pro 


Ala 


Val 


Lys 


Asp 


He 


His 


Gin 


He 


Ala 


Ser 


Arg 


His 






195 










200 










205 








Tyr 


Ala 


Phe 


Glu 


Gly 


Arg 


Cys 


Phe 


Val 


Leu 


Ala 


Ala 


Gly 


Ser 


Val 


Leu 




210 










215 










220 










Thr 


Arg 


Arg 


Asp 


He 


He 


Glu 


Gly 


Phe 


Asn 


Ser 


Leu 


Ala 


Arg 


Ala 


Asp 


225 










230 










235 










240 


Ser 


Asp 


Ala 


Leu 


Glu 


Leu 


Leu 


Lys 


Ala 


He 


Ser 


Gly 


Glu 


Asp 


Ser 


Asp 










245 










250 










255 




Leu 


He 


Leu 


Asn 


Gly 


Gly 


Ser 


Ala 


He 


He 


Ala 


Pro 


Asn 


Gly 


Glu 


Tyr 








260 










265 










270 






Leu 


Ala 


Gly 


Pro 


Val 


Phe 


Asn 


Glu 


Pro 


Ser 


He 


He 


Tyr 


Ala 


Glu 


He 






275 










280 










285 








Asp 


Pro 


Ala 


Leu 


He 


Ser 


Glu 


Gly 


His 


Leu 


Thr 


Leu 


Asp 


Thr 


Ser 


Gly 




290 










295 










300 










His 


Tyr 


Ser 


Arg 


Pro 


Asp 


He 


Phe 


Arg 


Leu 


Glu 


He 


Asn 


Asp 


Gin 


Pro 


305 










310 










315 










320 


Gin 


His 


Asp 


Val 


Thr 


Phe 


Arg 


Ser 


Gly 


His 















325 330 

<210> 39 
<211> 1008 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 39 

atgaaaaata tcaaaaactc agaaaaaagc agcacagtaa gagtcgctgc ggtacaaatc 60 

agtccggtgt tgtacaaccg cgaagctacc gttcaaaaag tagtcaacaa aatccttgaa 12 0 

ctaggaaaac aaggggtaca attcgccact tttccggaaa cgatagtgcc ttattatcct 18 0 

tatttctctt ttattcaggc gccttatgcc atgggcaaag aacacctgcg cttgcttgaa 24 0 

caatcagtta ctgttccgtc agccgcgacc gatgccataa gtgaggcggc aaaggaagcc 300 

aatatggtag tgtctattgg tgtcaatgaa cgagacggtg gtaccattta caatacgcaa 360 

ctcctttttg atgctgacgg aacattaatt cagcgcagac gtaaacttac accaacgtat 420 

catgaaagaa tgatttgggg acaaggtgac gcttcaggtc ttcgtgccac agacagcgct 4 80 

gttgggcgta tcgggcagtt ggcttgttgg gaacattaca atccattgtt ccgttatgct 54 0 

ttgattgctg atggagaaca aatccattct gccatgtatc ccggatcatt tttaggtgcg 600 

ttgcacggtg aacaaaccga aatcaatgta cgccaacacg ctttagaatc ggccagcttc 660 

gtcgtagtgg ctaccggttg gttggatgcc gatcaacaag cacaaattgc gaaagacacc 720 

ggtggaccaa tcggaccaat ttcgggaggt tgttttacag ccgttatagg ccctgacgga 78 0 

caactaatcg gggaagccct tacatcaggt gaaggggaag tgattgccga tattgatttg 840 

gcacaaattg atgcccgcaa aagattaatg gatgccagtg gtcactacaa ccgtcctgaa 900 

ttgt'tgagct tgcatatcga tcacactccg actgctccta tgcatgaaag agtagtttac 960 

actgagccgg gattagcaaa aagacaaaat gaaaattcat caaattaa 1008 
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<210> 40 
<211> 335 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 40 
Met Lys Asn lie 
1 

Ala Val Gin lie 
20 

Lys Val Val Asn 
35 

Ala Thr Phe Pro 
50 

lie Gin Ala Pro 
65 

Gin Ser Val Thr 

Ala Lys Glu Ala 
100 

Gly Gly Thr lie 
115 

Leu lie Gin Arg 
130 

lie Trp Gly Gin 
145 

Val Gly Arg lie 

Phe Arg Tyr Ala 
180 

Tyr Pro Gly Ser 
195 

Asn Val Arg Gin 
210 

Thr Gly Trp Leu 
225 

Gly Gly Pro lie 

Gly Pro Asp Gly 
260 

Glu Val lie Ala 
275 

Leu Met Asp Ala 
290 

His lie Asp His 
305 

Thr Glu Pro Gly 



Lys Asn Ser Glu 
5 

Ser Pro Val Leu 

Lys lie Leu Glu 
40 

Glu Thr He Val 
55 

Tyr Ala Met Gly 
70 

Val Pro Ser Ala 
85 

Asn Met Val Val 

Tyr Asn Thr Gin 
120 

Arg Arg Lys Leu 
135 

Gly Asp Ala Ser 
150 

Gly Gin ■ Leu Ala 
165 

Leu He Ala Asp 

Phe Leu Gly Ala 
200 

His Ala Leu Glu 
215 

Asp Ala Asp Gin 
230 

Gly Pro He Ser 
245 

Gin Leu lie Gly 

Asp He Asp Leu 
280 

Ser Gly His Tyr 
295 

Thr Pro Thr Ala 
310 

Leu Ala Lys Arg 

325 



Lys Ser Ser Thr 
10 

Tyr Asn Arg Glu 
25 

Leu Gly Lys Gin 

Pro Tyr Tyr Pro 
60 

Lys Glu His Leu 
75 

Ala Thr Asp Ala 
90 

Ser He Gly Val 
105 

Leu Leu Phe Asp 

Thr Pro Thr Tyr 
140 

Gly Leu Arg Ala 
155 

Cys Trp Glu His 
170 

Gly Glu Gin He 
185 

Leu His Gly Glu 

Ser Ala Ser Phe 
220 

Gin Ala Gin He 
235 

Gly Gly Cys Phe 
250 

Glu Ala Leu Thr 
265 

Ala Gin He Asp 

Asn Arg Pro Glu 
300 

Pro Met His Glu 
315 

Gin Asn Glu Asn 
330 



Val Arg Val Ala 
15 

Ala Thr Val Gin 
30 

Gly Val Gin Phe 
45 

Tyr Phe Ser Phe 

Arg Leu Leu Glu 
80 

He Ser Glu Ala 
95 

Asn Glu Arg Asp 
110 

Ala Asp Gly Thr 
125 

His Glu Arg Met 

Thr Asp Ser Ala 
160 

Tyr Asn Pro Leu 
175 

His Ser Ala Met 
190 

Gin Thr Glu He 
205 

Val Val Val Ala 

Ala Lys Asp Thr 
240 

Thr Ala Val He 
255 

Ser Gly Glu Gly 

270 

Ala Arg Lys Arg 
285 

Leu Leu Ser Leu 

Arg Val Val Tyr 
320 

Ser Ser Asn 
335 



<210> 41 
<211> 966 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
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<400> 41 

atgcccaatg 

tttcttgatc 

gaaggagcgc 

tggggtattc 

aatgcggtca 

aatgcctacg 

tataatacgc 

gtgccaacgg 

tacgatacgc 

gcacgctatg 

ggccaaccct 

ggttgctgta 

taccatgctg 

ggacacttta 

ccacgtatgg 

gatgtgttcc 

tcataa 



agaataacat 

gtgaagcaac 

gcctgattgt 

cttccggtga 

ctattcccag 

tagtgatggg 

tgttgtacat 

gcggcgaacg 

cattgggaaa 

ctatgtatgc 

ggctttctac 

tcgcgatgcg 

aaatggatga 

tcgcagggcc 

tgcaaggccc 

agttgacggt 



cgctacattc 

catcgacaaa 

gtttccagaa 

gcaaggttta 

tgacgcgact 

aatgagcgaa 

caatgcgcag 

cctggtatgg 

actcggtggt 

ctgggggaca 

attacggcat 

aaaagacgat 

atggattaat 

tgtgcgcaag 

gaagtggatg 

gcatacgaat 



aaagttgccg 
gcttgcgggt 
gcgttcatcc 
ctcaatgaac 
gacaggctgt 
cggaatgtcg 
ggggagattt 
gcgcagggtg 
ctcatttgct 
caaatctatg 
atcgccaaag 
attccggata 
gttggcgaca 
caagaagaaa 
ctcgatgtgg 
gtgagagaga 



cagtccaggc 

tgattgccac 

caacctatcc 

tctatgcaga 

gcgaggccgc 

aggcgagtgg 

tagggaaaca 

atggcagcac 

gggaaaatta 

tcgcggcaac 

aaggcagggt 

gttactccat 

gtgtgattgt 

ttctctacgc 

cggggcatta 

tgatgcgggt 



cacaccggtg 
tgccggcaat 
tgaatgggtt 
gctgctcacc 
gcagcttgcg 
cgcaagccta 
tcgaaagctg 
gctgcaggtc 
tatgccgctg 
gtgggatcga 
atacgtgatt 
gaagcagaaa 
caatcccgaa 
ggagatcgat 
tgcgagacca 
ggaagatgat 



<210> 42 
<211> 321 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from 



an environmental sample 



<400> 42 

Met Pro Asn Glu Asn 

1 5 
Ala Thr Pro Val Phe 
20 

Gly Leu He Ala Thr 
35 

Pro Glu Ala Phe He 
50 

Ser Gly Glu Gin Gly 
65 

Asn Ala Val Thr He 
85 

Ala Gin Leu Ala Asn 
100 

Val Glu Ala Ser Gly 
115 

Ala Gin Gly Glu He 
130 

Gly Glu Arg Leu Val 
145 

Tyr Asp Thr Pro Leu 
165 

Tyr Met Pro Leu Ala 
180 

Tyr Val Ala Ala Thr 
195 

Arg His He Ala Lys 
210 

Ala Met Arg Lys Asp 
225 

Tyr His Ala Glu Met 
245 

Val Asn Pro Glu Gly 



Asn He Ala 

Leu Asp Arg 

Ala Gly Asn 
40 

Pro Thr Tyr 
55 

Leu Leu Asn 
70 

Pro Ser Asp 

Ala Tyr Val 

Ala Ser Leu 
120 

Leu Gly Lys 

135 
Trp Ala Gin 
150 

Gly Lys Leu 
Arg Tyr Ala 



Trp Asp Arg 
200 

Glu Gly Arg 

215 
Asp He Pro 
230 

Asp Glu Trp 
His Phe He 



Thr Phe Lys 
10 

Glu Ala Thr 
25 

Glu Gly Ala 

Pro Glu Trp 

Glu Leu Tyr 
75 

Ala Thr Asp 
90 

Val Met Gly 
105 

Tyr Asn Thr 

His Arg Lys 

Gly Asp Gly 
155 

Gly Gly Leu 
170 

Met Tyr Ala 
185 

Gly Gin Pro 



Val Tyr Val 



Asp Ser Tyr 
235 

He Asn Val 

250 
Ala Gly Pro 



Val Ala Ala 

He Asp Lys 
30 

Arg Leu He 
45 

Val Trp Gly 
60 

Ala Glu Leu 

Arg Leu Cys 

Met Ser Glu 
110 

Leu Leu Tyr 

125 
Leu Val Pro 
140 

Ser Thr Leu 

He Cys Trp 

Trp Gly Thr 
190 

Trp Leu Ser 

205 
He Gly Cys 
220 

Ser Met Lys 



Gly Asp Ser 
Val Arg Lys 



Val Gin 
15 

Ala Cys 

Val Phe 

lie Pro 

Leu Thr 

80 
Glu Ala 
95 

Arg Asn 

lie Asn 

Thr Gly 

Gin Val 
160 
Glu Asn 
175 

Gin He 

Thr Leu 

Cys He 

Gin Lys 
240 
Val He 
255 

Gin Glu 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 

966 
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260 265 270 

Glu lie Leu Tyr Ala Glu lie Asp Pro Arg Met Val Gin Gly Pro Lys 

275 280 285 

Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Gin 

290 295 300 

Leu Thr Val His Thr Asn Val Arg Glu Met Met Arg Val Glu Asp Asp 
305 310 315 320 

Ser 



<210> 43 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 43 

atgagagtcg 

actgtcgaaa 

acgtttccag 

caaattcttg 

ctcgctacgg 

gtcaacgagc 

acattgattc 

cagggcgatg 

gcgtgttttg 

atccattctg 

atcaacataa 

ttggatgctg 

tcaggaggtt 

cgctcaggtg 

cacgtagtag 

cggacgccga 

gagtcgcagg 



ttaaagccgc 
acgtcgtccg 
agactgtggt 
gcggtggtga 
aagcgatcgg 
gtqatggagg 
aaagacgacg 
gctcaggttt 
agcactacaa 
cgatgttccc 
ggcagcatgc 
accagcaggc 
gcttcacagc 
aaggcgtttg 
acacacgcgg 
cggcccacat 
atgttttcac 



ggcggtccaa 
taaaatccac 
gccttactac 
gttcctgaag 
cgaggcttgc 
aactctatac 
caagatcaca 
acgggccatt 
ccctctcgca 
cggctccatg 
aatggagtct 
tcaaatcatg 
gatcgcggca 
tattgccgac 
ccattatagc 
acacgaaagg 
cgctgttgct 



ctgaaacctg 
gagcttggac 
ccgtactttt 
ctgcttgatc 
aggcaggcgg 
aacacgcaac 
cccacccatt 
gacagcaagg 
cgttacgcga 
ttcggcgata 
gggtgctttg 
aaagacacgg 
ccagatggaa 
ctcgatttca 
cggccagaat 
accgagcaac 
taa 



tcctctatag 
agcaaggggt 
cgatcgtgca 
agtcagtaac 
gcgtcgttgt 
ttctctttga 
acgagcgcat 
tcgcgcgcat 
tgatggccga 
atttttcaga 
tcgtttgcgc 
gatgtgagat 
gccttatagg 
aacttatcga 
tgctcagcct 
cgagggcggc 



ccgagaggga 
acagttcgcg 
gtccggctat 
cgtgccatct 
ctccatcggc 
tgccgacgga 
ggtctggggc 
tggtcaactg 
tggcgagcag 
gaaggtggaa 
tactgcctgg 
cggaccgatc 
tgaacccatc 
caagcggaag 
cctgattgat 
catcgagaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
993 



<210> 44 
<211> 330 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 44 



Met 


Arg 


Val 


Val 


Lys 


Ala 


Ala 


Ala 


Val 


Gin 


Leu 


Lys 


Pro 


Val 


Leu 


Tyr 


1 






5 










10 










15 




Ser 


Arg 


Glu 


Gly 


Thr 


Val 


Glu 


Asn 


Val 


Val 


Arg 


Lys 


He 


His 


Glu 


Leu 






20 










25 










30 






Gly 


Gin 


Gin 


Gly 


Val 


Gin 


Phe 


Ala 


Thr 


Phe 


Pro 


Glu 


Thr 


Val 


Val 


Pro 




35 








40 










45 








Tyr 


Tyr 


Pro 


Tyr 


Phe 


Ser 


He 


Val 


Gin 


Ser 


Gly 


Tyr 


Gin 


He 


Leu 


Gly 


50 










55 










60 










Gly 


Gly 


Glu 


Phe 


Leu 


Lys 


Leu 


Leu 


Asp 


Gin 


Ser 


Val 


Thr 


Val 


Pro 


Ser 


65 








70 










75 










80 


Leu 


Ala 


Thr 


Glu 


Ala 


He 


Gly 


Glu 


Ala 


Cys 


Arg 


Gin 


Ala 


Gly 


Val 


Val 








85 








90 










95 




Val 


Ser 


lie 


Gly 
100 


Val 


Asn 


Glu 


Arg 


Asp 
105 


Gly 


Gly 


Thr 


Leu 


Tyr 
110 


Asn 


Thr 


Gin 


Leu 


Leu 


Phe 


Asp 


Ala 


Asp 


Gly 


Thr 


Leu 


He 


Gin 


Arg 


Arg 


Arg 


Lys 
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115 










120 


He 


Thr 
130 


Pro 


Thr 


His 


Tyr 


Glu 
135 


Arg 


Ser 


Gly 


Leu 


Arg 


Ala 


He 


Asp 


Ser 


145 










150 






Ala 


Cys 


Phe 


Glu 


His 
165 


Tyr 


Asn 


Pro 


Asp 


Gly 


Glu 


Gin 
180 


He 


His 


Ser 


Ala 


Asp 


Asn 


Phe 
195 


Ser 


Glu 


Lys 


Val 


Glu 
200 


Glu 


Ser 
210 


Gly 


Cys 


Phe 


Val 


Val 
215 


Cys 


Gin 


Gin 


Ala 


Gin 


He 


Met 


Lys 


Asp 


225 










230 






Ser 


Gly 


Gly 


Cys 


Phe 
245 


Thr 


Ala 


He 


Gly 


Glu 


Pro 


He 
260 


Arg 


Ser 


Gly 


Glu 


Phe 


Lys 


Leu 


He 


Asp 


Lys 


Arg 


Lys 




275 










280 


Tyr 


Ser 


Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


290 










295 




Ala 


His 


He 


His 


Glu 


Arg 


Thr 


Glu 


305 










310 






Glu 


Ser 


Gin 


Asp 


Val 
325 


Phe 


Thr 


Ala 



125 



Met 


Val 


Trp 


Gly 
140 


Gin 


Gly 


Asp 


Gly 


Lys 


Val 


Ala 
155 


Arg 


He 


Gly 


Gin 


Leu 
160 


Leu 


Ala 
170 


Arg 


Tyr 


Ala 


Met 


Met 
175 


Ala 


Met 


Phe 


Pro 


Gly 


Ser 


Met 


Phe 


Gly 


185 










190 






He 


Asn 


He 


Arg 


Gin 
205 


His 


Ala 


Met 


Ala 


Thr 


Ala 


Trp 

220 


Leu 


Asp 


Ala 


Asp 


Thr 


Gly 


Cys 
235 


Glu 


He 


Gly 


Pro 


He 
240 


Ala 


Ala 
250 


Pro 


Asp 


Gly 


Ser 


Leu 
255 


He 


Gly 


Val 


Cys 


He 


Ala 


Asp 


Leu 


Asp 


265 










270 






His 


Val 


Val 


Asp 


Thr 

285 


Arg 


Gly 


His 


Leu 


Leu 


He 


Asp 
300 


Arg 


Thr 


Pro 


Thr 


Gin 


Pro 


Arg 
315 


Ala 


Ala 


He 


Glu 


Lys 
320 


Val 


Ala 
330 















<210> 45 
<211> 996 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



60 



<400> 45 

gtgaaaccgc cgacgtcatt ccgcgtggcc gccgttcagg cctgtcccgt ctacctcgat 

cgcgacctga cgatcggcaa ggcagaaggg ttgatcgccg aggcggctgg aaacggcgcg 120 

aacctcatcg tgttccccga agcgttcgtg cctggctatc cggtgtgggt gtggttcatc 180 

ccgcccggtc gcacggcgga tcttcgcgaa gcgtacagcg tcctccacgc caactcgatc 240 

gcggtcccca gccagtcgac cgagcgtctg tgcgcggcgg cgcggcgcgc tggcgtcgcc 300 

gtggcgattg gtgtcaacga aagaaacagc gaggccagcg gcggcagcct cttcaacacg 360 

ctgctgtaca tcggaccgga cggcacgctg ctcggtaaac accgaaagct ggtgccaaca 42 0 

ggcggagagc gtcttgtctg ggccagcggc gacggcagcg accttgccgt gttcacactg 4 80 

cctttcgcgc gagtcggcgg actgatctgc tgggagaact acatgccgct cgcccgctac 54 0 

gcgctggcgg cctggggtgc gcaaatccac gtggcgccga cctgggaccg cggcgagccg 600 

tggctctcaa cactgcgtca tgtcgcgaag gaaggtagag ccgtgacgat cggctgctgt 660 

caggccgtcc gcaaggaaga cattccggac gggctggcat tcaagtcccg atacctggcc 720 

gacgtgggcg cctgggtcaa cccaggcggg agcgtcatcg tcgatcccga cggaaaaatt 7 80 

cttgccggac ctgcgaacga aaccgaaggc atcttgtacg ctgacatcag ggccgatcag 84 0 

ctcgtcgggc cgagatggca actcgacatt gccggacact acgcgcggcc ggacgtcttc 900 

gagctgatcg tgcatcggcg ttcgacgccg atgattcgcg aggtctcggc gcctcgtcgt 960 

cgcgcaagaa cgggaaagcg accgcgacgc cgctga 996 

<210> 46 
<211> 331 
<212> PRT 
<213> Unknown 



<220> 
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<223> Obtained from an environmental sample 
<400> 46 

Val Lys Pro Pro Thr Ser Phe Arg Val Ala Ala Val Gin Ala Cys Pro 

15 10 15 

Val Tvr Leu Asp Arg Asp Leu Thr He Gly Lys Ala Glu Gly Leu He 

20 25 30 

Ala Glu Ala Ala Gly Asn Gly Ala Asn Leu He Val Phe Pro Glu Ala 

35 40 45 

Phe Val Pro Gly Tyr Pro Val Trp Val Trp Phe He Pro Pro Gly Arg 

50 55 60 

Thr Ala Asp Leu Arg Glu Ala Tyr Ser Val Leu His Ala Asn Ser He 
65 70 ^75 80 

Ala Val Pro Ser Gin Ser Thr Glu Arg Leu Cys Ala Ala Ala Arg Arg 

85 90 95 

Ala Gly Val Ala Val Ala He Gly Val Asn Glu Arg Asn Ser Glu Ala 

100 105 HO 

Ser Glv Gly Ser Leu Phe Asn Thr Leu Leu Tyr He Gly Pro Asp Gly 

115 120 125 

Thr Leu Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly Glu Arg 

130 135 140 

Leu Val Trp Ala Ser Gly Asp Gly Ser Asp Leu Ala Val Phe Thr Leu 
145 150 155 160 

Pro Phe Ala Arg Val Gly Gly Leu He Cys Trp Glu Asn Tyr Met Pro 

165 170 175 

Leu Ala Arg Tyr Ala Leu Ala Ala Trp Gly Ala Gin He His Val Ala 

180 185 190 

Pro Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg His Val 

195 200 205 

Ala Lys Glu Gly Arg Ala Val Thr He Gly Cys Cys Gin Ala Val Arg 

210 215 220 

Lvs Glu Asp He Pro Asp Gly Leu Ala Phe Lys Ser Arg Tyr Leu Ala 
225 230 235 240 

Asp Val Gly Ala Trp Val Asn Pro Gly Gly Ser Val He Val Asp Pro 

245 250 255 

Asp Gly Lys He Leu Ala Gly Pro Ala Asn Glu Thr Glu Gly He Leu 

260 265 270 

Tyr Ala Asp He Arg Ala Asp Gin Leu Val Gly Pro Arg Trp Gin Leu 

275 280 285 

Asp He Ala Gly His Tyr Ala Arg Pro Asp Val Phe Glu Leu He Val 

290 295 300 

His Arg Arg Ser Thr Pro Met He Arg Glu Val Ser Ala Pro Arg Arg 
305 310 315 320 

Arg Ala Arg Thr Gly Lys Arg Pro Arg Arg Arg 
325 330 

<210> 47 
<211> 1014 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 47 

gtgaaagaag caatcaaagt agcctgtgtg caagcagctc cagtctttct cgacctggac 
gccacagtgg acaagaccgt cgccctgatt gaggaggcag cccgtaacgg cgcacgccta 
atcgcctttc cagagacctg gattccaggc tacccatggt tcctttggct ggactcacca 
gcctggggga tgcaattcgt gcgccgatac cacgagaact cactggtcct cgacagccct 
caggccaagc gcatcagtga ggccgcccag cgcgccggta tatacgtcgc gctagggtac 
agcgaacgcg tgagcggaac cctctacatg gggcagtggc tcattgacga taagggcgaa 



60 
120 
180 
240 
300 
360 
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acagctgggc tgcgccgaaa gctgaaacca acccatgtag agcgaaccct cttcggtgaa 42 0 

ggcgacggat catccctttc cactttcgac acaccgttgg gggtgctggg cggactctgc 4 80 

tgttgggaac acttacaacc tctttcgaaa tatgcgctct acgcacagaa cgaggaaata 540 

cacttcgccg cctggcctag cttcagcatc taccgtcaag cgacagaagt ccttggacca 600 

gaagtaaatg tcgcagcttc tcggatctac gccgtggaag ggcagtgttt tgttctcgct 660 

tcctgcgcgc tcgtctcgcc agagatgatc gaaatgctct gcactgacga aagcaagcac 720 

agccttcttc aggccggcgg cgggtactcc cgcattatcg gtcccgatgg cagcgaccta 7 80 

gcgcgcccct tgggcgaaaa cgaggaaggt attctctatg ccactctgga ccctgccgct 84 0 

cgaatctatg caaagaccgc agctgatcca gccgggcact actccagacc agacgtcact 900 

cggctgctga tcaatcgcag tgccaatcag ccagtcgtag aggttggaag ggaaatacct 960 

gcatcggccc aaggctttga agttgaggcg gcccccgggt acgaaggcga ttga 1014 



<210> 48 
<211> 337 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 48 



Val 


Lys 


Glu 


Ala 


He 


Lys 


Val 


Ala 


Cys 


Val 


Gin 


Ala 


Ala 


Pro 


Val 


Phe 


1 








5 










10 










15 




Leu 


Asp 


Leu 


Asp 


Ala 


Thr 


Val 


Asp 


Lys 


Thr 


Val 


Ala 


Leu 


He 


Glu 


Glu 








20 










25 










30 






Ala 


Ala 


Arg 


Asn 


Gly 


Ala 


Arg 


Leu 


He 


Ala 


Phe 


Pro 


Glu 


Thr 


Trp 


He 






35 










40 










45 








Pro 


Gly 


Tyr 


Pro 


Trp 


Phe 


Leu 


Trp 


Leu 


Asp 


Ser 


Pro 


Ala 


Trp 


Gly 


Met 




50 










55 










60 










Gin 


Phe 


Val 


Arg 


Arg 


Tyr 


His 


Glu 


Asn 


Ser 


Leu 


Val 


Leu 


Asp 


Ser 


Pro 


65 










70 










75 










80 


Gin 


Ala 


Lys 


Arg 


He 


Ser 


Glu 


Ala 


Ala 


Gin 


Arg 


Ala 


Gly 


He 


Tyr 


Val 










85 










90 










95 




Ala 


Leu 


Gly 


Tyr 


Ser 


Glu 


Arg 


Val 


Ser 


Gly 


Thr 


Leu 


Tyr 


Met 


Gly 


Gin 








100 










105 










110 






Trp 


Leu 


He 


Asp 


Asp 


Lys 


Gly 


Glu 


Thr 


Ala 


Gly 


Leu 


Arg 


Arg 


Lys 


Leu 






115 










120 










125 








Lys 


Pro 


Thr 


His 


Val 


Glu 


Arg 


Thr 


Leu 


Phe 


Gly 


Glu 


Gly 


Asp 


Gly 


Ser 




130 










135 










140 










Ser 


Leu 


Ser 


Thr 


Phe 


Asp 


Thr 


Pro 


Leu 


Gly 


Val 


Leu 


Gly 


Gly 


Leu 


c ys 


145 










150 










155 










160 


Cys 


Trp 


Glu 


His 


Leu 


Gin 


Pro 


Leu 


Ser 


Lys 


Tyr 


Ala 


Leu 


Tyr 


Ala 


Gin 










165 










170 










175 




Asn 


Glu 


Glu 


He 


His 


Phe 


Ala 


Ala 


Trp 


Pro 


Ser 


Phe 


Ser 


He 


Tyr 


Arg 








180 










185 










190 






Gin 


Ala 


Thr 


Glu 


Val 


Leu 


Gly 


Pro 


Glu 


Val 


Asn 


Val 


Ala 


Ala 


Ser 


Arg 






195 










200 










205 








lie 


Tyr 


Ala 


Val 


Glu 


Gly 


Gin 


Cys 


Phe 


Val 


Leu 


Ala 


Ser 


Cys 


Ala 


Leu 




210 










215 










220 










Val 


Ser 


Pro 


Glu 


Met 


He 


Glu 


Met 


Leu 


Cys 


Thr 


Asp 


Glu 


Ser 


Lys 


His 


225 










230 










235 










240 


Ser 


Leu 


Leu 


Gin 


Ala 


Gly 


Gly 


Gly 


Tyr 


Ser 


Arg 


He 


He 


Gly 


Pro 


Asp 










245 










250 










255 




Gly 


Ser 


Asp 


Leu 


Ala 


Arg 


Pro 


Leu 


Gly 


Glu 


Asn 


Glu 


Glu 


Gly 


He 


Leu 








260 










265 










270 






Tyr 


Ala 


Thr 


Leu 


Asp 


Pro 


Ala 


Ala 


Arg 


He 


Tyr 


Ala 


Lys 


Thr 


Ala 


Ala 






275 










280 










285 








Asp 


Pro 


Ala 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 


Val 


Thr 


Arg 


Leu 


Leu 


He 




290 










295 










300 










Asn 


Arg 


Ser 


Ala 


Asn 


Gin 


Pro 


Val 


Val 


Glu 


Val 


Gly 


Arg 


Glu 


He 


Pro 


305 








310 










315 










320 
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Ala Ser Ala Gin Gly Phe Glu Val Glu Ala Ala Pro Gly Tyr Glu Gly 
325 330 335 

Asp 



<210> 49 
<211> 1038 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 49 

atgaacaaag tcgtggctgc tgccgttcag tgcagcccgg tgctttactc ttgcgccgga 60 

actgtaaata aaatttgcga gtggattgca gatttgggca aacaaggggt tgagctggcg 12 0 

gtgttcgcgg aaaccctggt gccttactac ccgtattttt cttttatcca ggctccttgt 180 

gcgatgggcg cgcaacattt gttgttgatg caagaatcag tagaggttcc ttccatctac 2 4 0" 

acgcaacaaa ttgccgctgc agcaaaagca gcgaagatgg tggtgtcagt tggtattaac 300 

gaacgcgacg gcggttctat ttataacgcg caattattat ttgatgcggg cggtcagctt 3 60 

gttcagcacc gccgaaaaat tacgccgaca tttcatgagc gcatggtgtg ggggcagggc 420 

gatggctccg gtttgtgcgc agtggatacg gcagttggtc gtgttggttc gctcgcttgc 4 80 

tgggaacatt acaacccact cgcgcgttac gcattgatgg cagatcgcga acaaattcac 540 

gtgagtatgt ttcccggttc tttggtcggc gaaatttttg ccgagcaaat tgaagcaact 600 

attcgtcacc acgcattgga gtccggttgc tttgtggtaa atgcgacggg ctggttaacg 6 60 

ccggaacagc aagctcaaat cgtaaaagat actggtggtc ctatcgctgc cattagcggt 720 

ggttgtttca ccgccattgt ttcaccggaa ggaaaattgc tcggcacgcc attgcgcagt 7 80 

gattccgggg agggtgcctg tatcgccgaa ctggatttta atctcatcaa taagcgtaag 840 

cgcatgatgg attctgtcgg ccattacagt cgtcctgaat tgctcagttt gctgattgat 900 

aaaacaccga caagtcatac acatccgctt aaaaaacctt tggctcccag tgaaaaaaat 9 60 

acgccagagg atatcgccac tggtttaaca ctggtcactc ccgtttcaaa tgcaaacctt 1020 

ttcagcgcaa gcaactag 1038 

<210> 50 
<211> 345 
<212> PRT 
<213> "Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 50 



Met 


Asn 


Lys 


Val 


Val 


Ala 


Ala 


Ala 


Val 


Gin 


Cys 


Ser 


Pro 


Val 


Leu 


Tyr 


1 






5 










10 










15 




Ser 


Cys 


Ala 


Gly 
20 


Thr 


Val 


Asn 


Lys 


He 
25 


Cys 


Glu 


Trp 


He 


Ala 
30 


Asp 


Leu 


Gly 


Lys 


Gin 
35 


Gly 


Val 


Glu 


Leu 


Ala 
40 


Val 


Phe 


Ala 


Glu 


Thr 
45 


Leu 


Val 


Pro 


Tyr 


Tyr 
50 


Pro 


Tyr 


Phe 


Ser 


Phe 
55 


He 


Gin 


Ala 


Pro 


Cys 
60 


Ala 


Met 


Gly 


Ala 


Gin 


His 


Leu 


Leu 


Leu 


Met 


Gin 


Glu 


Ser 


Val 


Glu 


Val 


Pro 


Ser 


He 


Tyr 


65 










70 










75 










80 


Thr 


Gin 


Gin 


He 


Ala 
85 


Ala 


Ala 


Ala 


Lys 


Ala 
90 


Ala 


Lys 


Met 


Val 


Val 
95 


Ser 


Val 


Gly 


lie 


Asn 
100 


Glu 


Arg 


Asp 


Gly 


Gly 
105 


Ser 


He 


Tyr 


Asn 


Ala 
110 


Gin 


Leu 


Leu 


Phe 


Asp 
115 


Ala 


Gly 


Gly 


Gin 


Leu 
120 


Val 


Gin 


His 


Arg 


Arg 
125 


Lys 


He 


Thr 


Pro 


Thr 
130 


Phe 


His 


Glu 


Arg 


Met 
135 


Val 


Trp 


Gly 


Gin 


Gly 
140 


Asp 


Gly 


Ser 


Gly 


Leu 


Cys 


Ala 


Val 


Asp 


Thr 


Ala 


Val 


Gly 


Arg 


Val 


Gly 


Ser 


Leu 


Ala 


Cys 
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145 










150 










155 










160 


Trp 


Glu 


His 


Tyr 


Asn 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ala 


Leu 


Met 


Ala 


Asp 


Arg 








165 










170 










175 




Glu 


Gin 


He 


His 
180 


Val 


Ser 


Met 


Phe 


Pro 
185 


Gly 


Ser 


Leu 


Val 


Gly 
190 


Glu 


He 


Phe 


Ala 


Glu 
195 


•Gin 


He 


Glu 


Ala 


Thr 
200 


He 


Arg 


His 


His 


Ala 
205 


Leu 


Glu 


Ser 


Gly 


Cys 


Phe 


Val 


Val 


Asn 


Ala 


Thr 


Gly 


Trp 


Leu 


Thr 


Pro 


Glu 


Gin 


Gin 


210 










215 










220 










Ala 


Gin 


He 


Val 


Lys 


Asp 


Thr 


Gly 


Gly 


Pro 


He 


Ala 


Ala 


He 


Ser 


Gly 


225 










230 










235 










240 


Gly 


Cys 


Phe 


Thr 


Ala 


He 


Val 


Ser 


Pro 


Glu 


Gly 


Lys 


Leu 


Leu 


Gly 


Thr 






245 










250 










255 




Pro 


Leu 


Arg 


Ser 


Asp 


Ser 


Gly 


Glu 


Gly 


Ala 


Cys 


He 


Ala 


Glu 


Leu 


Asp 






260 










265 










270 






Phe 


Asn 


Leu 
275 


He 


Asn 


Lys 


Arg 


Lys 
280 


Arg 


Met 


Met 


Asp 


Ser 

285 


Val 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


Leu 


Leu 


He 


Asp 


Lys 


Thr 


Pro 


Thr 


290 










295 










300 










Ser 


His 


Thr 


His 


Pro 


Leu 


Lys 


Lys 


Pro 


Leu 


Ala 


Pro 


Ser 


Glu 


Lys 


Asn 


305 










310 










315 










320 


Thr 


Pro 


Glu 


Asp 


He 


Ala 


Thr 


Gly 


Leu 


Thr 


Leu 


Val 


Thr 


Pro 


Val 


Ser 








325 










330 










335 




Asn 


Ala 


Asn 


Leu 
340 


Phe 


Ser 


Ala 


Ser 


Asn 
345 

















<210> 51 
<211> 897 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 51 

gtgaacgtcc 

gtgcggaagg 

ctgcccgagg 

atcgcctcgc 

ggcggcccgc 

atcggggtga 

ttcgcgcccg 

cgcacgttct 

cgcctgggcg 

cgggacgggg 

gcgatgcgca 

cggaccgccg 

gtccagttca 

gtccacgggg 

cgacgggtgc 



gcgtcgcggt 
cctgccgcct 
gcttcgtgcc 
cgaagtcggc 
tggcccgcga 
acgagcgcga 
acgggagcct 
gggggcaggg 
gcctgatctg 
tcgacttcta 
cgttcgcctt 
acttcccgga 
ccggggggag 
gcgaggagat 
tcgacacggc 



ggtgcaggcc 
gatcggcgag 
catcatgccg 
ggccctgcac 
gctcggcgac 
cgcccgccgg 
ggcccggcgc 
cgcgggcgac 
ctgggagaac 
cctggccccc 
cgaggccggc 
ggacttcccg 
cgtgatctgc 
cctctacgcc 
cggccactac 



acgccggccg 
gccgciggccg 
cgctcctgct 
cggcgcatct 
gccgcgcgcc 
ccgggcacgc 
caccgcaagc 
gacctcgagg 
ttcatgcccg 
acggcggacg 
gccttcgtcc 
ctgcgcgagg 
gacccgtggg 
gactgcgatc 
gaccgcccgg 



tgctcgacgg 
gcggcgcccg 
gggggcacca 
gggagaacgc 
gcgcggacgc 
tctggaacac 
tcgtgcccac 
cgctggccgc 
ccgcgcgccg 
accgggacat 
tctcgccggt 
agctcgccga 
gcaacctcct 
tcgacctcgt 
acctcgcctc 



gccggcgtcg 
cctgatcgcc 
cttcgcgctg 
cgtcgacgtc 
ctgggtggcc 
gctgctctgg 
catgcacgag 
ggacttcggc 
gcgcctgcac 
ctgggtcgcc 
gcagtacctg 
ctgccccgag 
ggcggggccg 
cctcgaggcc 
ggcctga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
897 



<210> 52 
<211> 298 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 52 

Val Asn Val Arg Val Ala Val Val Gin Ala Thr Pro Ala Val Leu Asp 



Page 34 



WO 03/097810 



PCT/US03/15712 



1 5 10 15 

Gly Pro Ala Ser Val Arg Lys Ala Cys Arg Leu lie Gly Glu Ala Ala 

20 25 30 

Ala Gly Gly Ala Arg Leu He Ala Leu Pro Glu Gly Phe Val Pro He 

35 40 45 

Met Pro Arg Ser Cys Trp Gly His His Phe Ala Leu He Ala Ser Pro 

50 55 60 

Lys Ser Ala Ala Leu His Arg Arg lie Trp Glu Asn Ala Val Asp Val 
65 70 "75 80 

Gly Gly Pro Leu Ala Arg Glu Leu Gly Asp Ala Ala Arg Arg Ala Asp 

85 90 95 

Ala Trp Val Ala He Gly Val Asn Glu Arg Asp Ala Arg Arg Pro Gly 

100 I 05 110 B1 

Thr Leu Trp Asn Thr Leu Leu Trp Phe Ala Pro Asp Gly Ser Leu Ala 

115 120 I 25 

Arg Arg His Arg Lys Leu Val Pro Thr Met His Glu Arg Thr Phe Trp 

130 135 140 

Gly Gin Gly Ala Gly Asp Asp Leu Glu Ala Leu Ala Ala Asp Phe Gly 
14 5 150 155 160 

Arg Leu Gly Gly Leu He Cys Trp Glu Asn Phe Met Pro Ala Ala Arg 

165 170 175 

Arg Arg Leu His Arg Asp Gly Val Asp Phe Tyr Leu Ala Pro Thr Ala 

180 185 190 

Asp Asp Arg Asp He Trp Val Ala Ala Met Arg Thr Phe Ala Phe Glu 

195 200 205 

Ala Gly Ala Phe Val Leu Ser Pro Val Gin Tyr Leu Arg Thr Ala Asp 

210 215 220 

Phe Pro Glu Asp Phe Pro Leu Arg Glu Glu Leu Ala Asp Cys Pro Glu 

230 235 
Val Gin Phe Thr Gly Gly Ser Val He Cys Asp Pro Trp Gly Asn Leu 

245 250 255 

Leu Ala Gly Pro Val His Gly Gly Glu Glu He Leu Tyr Ala Asp Cys 

260 265 270 

Asp Leu Asp Leu Val Leu Glu Ala Arg Arg Val Leu Asp Thr Ala Gly 

275 280 285 

His Tyr Asp Arg Pro Asp Leu Ala Ser Ala 
290 295 

<210> 53 
<211> 954 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

aJg^caatt caaagagtca gattaaaatc gcggcggtgc aggctgcccc ggtctttctg 60 

gaacgggagg cgacgattga caaagcttgc cggttgattg cggaggcagg ^gagcagggc 120 

gcgaatctgg tggtcttccc tgagtcattc gtcccggctt atcccgattg ggtctgggcc 180 

gt?ccggcag g?gaaacaac gctcctgaac acgctctatg ccgaactgct ggccaatgcc 240 

gt£gaa!t?c cgggtccggc gacagagcgg ctgagccagg cagccaacct ^cggggtt 300 

Ltgtcgcga tiggcttgac cgagcggaac atcgaggcca gtggggcgag ^tgtacaat 360 

act?tgctct ttScgactc agccggcggc atgttaggca agcatcgcaa ^gatcccc 420 

accggcggcg agcgcctggt ctgggctcag ggtgatggca gcactctggc ggtgtacgag 4b u 

actlgg??tg gLLatggg agggttgatt tgctgggaga attacatgcc -^ggcccgt 540 

tatgccttgt atgcctgggg gacgcagatt tacatcgcgg ^acctggga tcgaggcgag 600 

ccgtggctgt caacgctgcg gcatatcgcc gcggaaggcc gggttgttgt cgtcggctgt 66U 

ggca?ggccc tgcgcaaagc cgacctgccc gaccgctttg aactcaagca ^ttttac 720 

cagaacgccg a^glgtggat caatgtcggc gacagcgcga ttgttaaccc tgatggtgaa 780 
ttcatcgccg ggccgctgcg cgagcaggaa ggcatcctct atgctgagat tgatctggcc 
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cagatgcgcg gccccaaatg gatgctcgac gtggccggcc attacgctcg cccggatgtg 900 
tttgaactca tcgttcatcg ggaggcgcgg cccatgattg cgctaatttc atga 954 

<210> 54 
<211> 317 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 54 

Met Val Asn Ser Lys Ser Gin lie Lys lie Ala Ala Val Gin Ala Ala 

15 10 15 

Pro Val Phe Leu Glu Arg Glu Ala Thr lie Asp Lys Ala Cys Arg Leu 

20 25 30 

lie Ala Glu Ala Gly Glu Gin Gly Ala Asn Leu Val Val Phe Pro Glu 

35 40 45 

Ser Phe Val Pro Ala Tyr Pro Asp Trp Val Trp Ala Val Pro Ala Gly 

50 55 60 

Glu Thr Thr Leu Leu Asn Thr Leu Tyr Ala Glu Leu Leu Ala Asn Ala 
65 70 75 80 

Val Glu lie Pro Gly Pro Ala Thr Glu Arg Leu Ser Gin Ala Ala Asn 

85 90 95 

Leu Ala Gly Val Tyr Val Ala lie Gly Leu Thr Glu Arg Asn lie Glu 

100 105 110 

Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Phe Leu Asp Ser Ala 

115 120 125 

Gly Gly Met Leu Gly Lys His Arg Lys Leu He Pro Thr Gly Gly Glu 

130 135 140 

Arg Leu Val Trp Ala Gin Gly Asp Gly Ser Thr Leu Ala Val Tyr Glu 
145 150 155 160 

Thr Arg Phe Gly Lys Met Gly Gly Leu He Cys Trp Glu Asn Tyr Met 

165 170 175 

Pro Leu Ala Arg Tyr Ala Leu Tyr Ala Trp Gly Thr Gin He Tyr He 

180 185 190 

Ala Ala Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg His 

195 200 205 

He Ala Ala Glu Gly Arg Val Val Val Val Gly Cys Gly Met Ala Leu 

210 215 220 

Arg Lys Ala Asp Leu Pro Asp Arg Phe Glu Leu Lys Gin Arg Phe Tyr 
225 230 235 240 

Gin Asn Ala Asp Glu Trp He Asn Val Gly Asp Ser Ala He Val Asn 

245 250 255 

Pro Asp Gly Glu Phe He Ala Gly Pro Leu Arg Glu Gin Glu Gly He 

260 265 270 

Leu Tyr Ala Glu He Asp Leu Ala Gin Met Arg Gly Pro Lys Trp Met 

275 280 285 

Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Glu Leu He 

290 295 300 

Val His Arg Glu Ala Arg Pro Met He Ala Leu He Ser 
305 310 315 

<210> 55 
<211> 1017 
<212> DNA 
<213> Unknown 

<22Q> 

<223> Obtained from an environmental sample 
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<400> 55 

atgggtatcg 

gatctcgacg 

gccaagctga 

gactcgccgg 

gcctatgaca 

gccgtgctcg 

gggcccgatg 

acggtgtacg 

cgcctcggcg 

gcgcagaacg 

gcggtggcgc 

tcctgcttcg 

gaccgaccgg 

cctgacggca 

gagatcgacc 

tcgcggcccg 

ttctcgccac 



aacatccgaa 
cgtcgatcga 
tcgcattccc 
cctgggcgat 
gcccgcaggc 
ggctgtccga 
gcgagaccat 
gcgagggcga 
cgctctgctg 
agcaggtgca 
tcggcgccga 
tgctggcgcc 
acaagcatac 
gccagatcgg 
ttggggccat 
acgtgacgcg 
cggccgaggc 



gtacaaggtc 
caagtcgatc 
cgaggccttc 
cggccgcggc 
cgagaagctg 
gcgcgacggc 
cgcaaaacgc 
cggcagcgat 
ctgggagcat 
tgtcgcggcc 
ggtgaacaac 
atgcgccacc 
gctgctgcat 
cgacaagctc 
tggcgtcgcc 
gctcctgctc 
ggtcgagccc 



gccgtggtgc 
gcgctgatcg 
attcccggct 
tttgtgcagc 
cgcgcggcgg 
ggcagtctct 
cgcaagctgc 
ctcgcagtcc 
ttgcagccac 
tggccgagct 
gcggcctcgc 
gtctcgcagg 
gtcggcggcg 
gcgcccgacc 
aagaacgcgg 
aacaagaaac 
acagatatcg 



aggcagctcc 
aggaggcggc 
atccctggca 
gctattttga 
tgcgcaaggc 
atctggcgca 
ggccgacaca 
acaaccgtcc 
tgtcgaaata 
tttcgctcta 
gcgtctatgc 
ccatgatcga 
gttttgccgc 
aggaagggct 
ccgatcccgc 
cgtacaagcg 
cagcggcggc 



cgcctggctc 
ccaaaaaggc 
catctggatg 
caattcgctc 
aaagctcacc 
atggttgatc 
tgccgagcgc 
cgatatcggc 
cgcgatgtac 
cgatcccttt 
agtcgaaggc 
cgagctctgc 
gatctatggt 
gttgatcgcg 
cgggcattat 
cgtcgagcag 
aagctga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1017 



<210> 56 
<211> 338 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 56 



Met 


Gly 


He 


Glu 


His 


Pro 


Lys 


Tyr 


1 








5 








Pro 


Ala 


Trp 


Leu 


Asp 


Leu 


Asp 


Ala 








20 










lie 


Glu 


Glu 


Ala 


Ala 


Gin 


Lys 


Gly 






35 










40 


Ala 


Phe 


lie 


Pro 


Gly 


Tyr 


Pro 


Trp 




50 










55 




Trp 


Ala 


He 


Gly 


Arg 


Gly 


Phe 


Val 


65 










70 






Ala 


Tyr 


Asp 


Ser 


Pro 


Gin 


Ala 


Glu 










85 








Ala 


Lys 


Leu 


Thr 


Ala 


Val 


Leu 


Gly 








100 










Leu 


Tyr 


Leu 


Ala 


Gin 


Trp 


Leu 


He 






115 










120 


Lys 


Arg 


Arg 


Lys 


Leu 


Arg 


Pro 


Thr 




130 










135 




Glu 


Gly Asp 


Gly 


Ser 


Asp 


Leu 


Ala 


145 










150 






Arg 


Leu 


Gly 


Ala 


Leu 


Cys 


Cys 


Trp 










165 








Tyr 


Ala 


Met 


Tyr 


Ala 


Gin 


Asn 


Glu 






180 










Ser 


Phe 


Ser 


Leu 


Tyr 


Asp 


Pro 


Phe 






195 










200 


Asn 


Asn 


Ala 


Ala 


Ser 


Arg 


Val 


Tyr 




210 










215 




Leu 


Ala 


Pro 


Cys 


Ala 


Thr 


Val 


Ser 


225 










230 






Asp 


Arg 


Pro 


Asp 


Lys 


His 


Thr 


Leu 










245 








Ala 


lie 


Tyr 


Gly 


Pro 


Asp 


Gly 


Ser 



Lys 


Val 


Ala 


Val 


Val 


Gin 


Ala 


Ala 


10 










15 




Ser 


He 


Asp 


Lys 


Ser 


He 


Ala 


Leu 


25 










30 






Ala 


Lys 


Leu 


He 


Ala 


Phe 


Pro 


Glu 










45 








His 


He 


Trp 


Met 


Asp 


Ser 


Pro 


Ala 








60 










Gin 


Arg 


Tyr 


Phe 


Asp 


Asn 


Ser 


Leu 






75 










80 


Lys 


Leu 


Arg 


Ala 


Ala 


Val 


Arg 


Lys 




90 










95 




Leu 


Ser 


Glu 


Arg 


Asp 


Gly 


Gly 


Ser 


105 










110 






Gly 


Pro 


Asp 


Gly 


Glu 


Thr 


He 


Ala 










125 








His 


Ala 


Glu 


Arg 


Thr 


Val 


Tyr 


Gly 








140 










Val 


His 


Asn 


Arg 


Pro 


Asp 


He 


Gly 






155 










160 


Glu 


His 


Leu 


Gin 


Pro 


Leu 


Ser 


Lys 




170 










175 




Gin 


Val 


His 


Val 


Ala 


Ala 


Trp 


Pro 


185 










190 






Ala 


Val 


Ala 


Leu 


Gly 


Ala 


Glu 


Val 










205 








Ala 


Val 


Glu 


Gly 


Ser 


Cys 


Phe 


Val 








220 










Gin 


Ala 


Met 


He 


Asp 


Glu 


Leu 


Cys 






235 










240 


Leu 


His 


Val 


Gly 


Gly 


Gly 


Phe 


Ala 




250 










255 




Gin 


He 


Gly 


Asp 


Lys 


Leu 


Ala 


Pro 
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260 

Asp Gin Glu Gly Leu 
275 

Val Ala Lys Asn Ala 
290 

Val Thr Arg Leu Leu 
305 

Phe Ser Pro Pro Ala 
325 

Ala Ser 



265 

Leu lie Ala Glu He Asp 
280 

Ala Asp Pro Ala Gly His 
295 

Leu Asn Lys Lys Pro Tyr 
310 315 
Glu Ala Val Glu Pro Thr 
330 







270 






Leu 


Gly 


Ala 


He 


Gly 




285 








Tyr 


Ser 


Arg 


Pro 


Asp 


300 










Lys 


Arg 


Val 


Glu 


Gin 










320 


Asp 


He 


Ala 


Ala 


Ala 






335 





<210> 57 
<211> 1014 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 57 

gtgaaagaag 

gccacagtgg 

atcgcctttc 

gcctggggga 

caggccaagc 

agcgaacgcg 

acagctgggc 

ggcgacggat 

tgttgggaac 

cacttcgccg 

gaagtaaatg 

tcctgcgcgc 

agccttcttc 

gcgcgcccct 

cgaatctatg 

cggctgctga 

gcatcggccc 



caatcaaagt 

acaagaccgt 

cagagacctg 

tgcaattcgt 

gcatcagtga 

tgagcggaac 

tgcgccgaaa 

catccctttc 

acttacaacc 

cctggcctag 

tcgcagcttc 

tcgtctcgcc 

aggccggcgg 

tgggcgaaaa 

caaagaccgc 

tcaatcgcag 

aaggctttga 



agcctgtgtg 

cgccctgatt 

gattccaggc 

gcgccgatac 

ggccgcccag 

cctctacatg 

gctgaaacca 

cactttcgac 

tctttcgaaa 

cttcagcatc 

tcggatctac 

agagatgatc 

cgggtactcc 

cgaggaaggt 

agctgatcca 

tgccaatcag 

agttgaggcg 



caagcagctc 

gaggaggcag 

tacccatggt 

cacgagaact 

cgcgccggta 

gggcagtggc 

acccatgtag 

acaccgttgg 

tatgcgctct 

taccgtcaag 

gccgtggaag 

gaaatgctct 

cgcattatcg 

attctctatg 

gccgggcact 

ccagtcgtag 

gcccccgggt 



cagtctttct 

cccgtaacgg 

tcctttggct 

cactggtcct 

tatacgtcgc 

tcattgacga 

agcgaaccct 

gggtgctggg 

acgcacagaa 

cgacagaagt 

ggcagtgttt 

gcactgacga 

gtcccgatgg 

ccactctgga 

actccagacc 

aggttggacg 

acggaggcga 



cgacctggac 
cgcacgccta 
ggactcacca 
cgacagccct 
gctagggtac 
taagggcgaa 
cttcggtgaa 
cggactctgc 
cgaggaaata 
ccttggacca 
tgttctcgct 
aagcaagcac 
cagcgaccta 
ccctgccgct 
agacgtcact 
ggaaatacct 
ttga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1014 



<210> 58 
<211> 337 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from 
<400> 58 

Val Lys Glu Ala He 

1 5 
Leu Asp Leu Asp Ala 
20 

Ala Ala Arg Asn Gly 
35 

Pro Gly Tyr Pro Trp 
50 

Gin Phe Val Arg Arg 

65 

Gin Ala Lys Arg He 
85 

Ala Leu Gly Tyr Ser 



an environmental sample 



Lys Val Ala 

Thr Val Asp 

Ala Arg Leu 
40 

Phe Leu Trp 
55 

Tyr His Glu 
70 

Ser Glu Ala 
Glu Arg Val 



Cys Val 

10 
Lys Thr 
25 

He Ala 

Leu Asp 

Asn Ser 

Ala Gin 
90 

Ser Gly 



Gin Ala Ala Pro 
Val Ala 
Phe Pro 



Ser Pro 

60 
Leu Val 
75 

Arg Ala 



Leu He 

30 
Glu Thr 
45 

Ala Trp 



Leu Asp 
Gly He 
Thr Leu Tyr Met 



Val 

15 

Glu 

Trp 

Gly 

Ser 

Tyr 

95 

Gly 



Phe 

Glu 

He 

Met 

Pro 

80 

Val 

Gin 
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100 










105 










110 






Trp 


Leu 


He 


Asp 


Asp 


Lys 


Gly 


Glu 


Thr 


Ala 


Gly. 


Leu 


Arg 


Arg 


Lys 


Leu 




115 










120 










125 








Lys 


Pro 


Thr 


His 


Val 


Glu 


Arg 


Thr 


Leu 


Phe 


Gly 


Glu 


Gly 


Asp 


Gly 


Ser 


130 










135 










140 










Ser 


Leu 


Ser 


Thr 


Phe 


Asp 


Thr 


Pro 


Leu 


Gly 


Val 


Leu 


Gly 


Gly 


Leu 


Cys 


145 










150 










155 










160 


Cys 


Trp 


Glu 


His 


Leu 


Gin 


Pro 


Leu 


Ser 


Lys 


Tyr 


Ala 


Leu 


Tyr 


Ala 


Gin 






165 










170 










175 




Asn 


Glu 


Glu 


He 
180 


His 


Phe 


Ala 


Ala 


Trp 
185 


Pro 


Ser 


Phe 


Ser 


He 

190 


Tyr 


Arg 


Gin 


Ala 


Thr 
195 


Glu 


Val 


Leu 


Gly 


Pro 

200 


Glu 


Val 


Asn 


Val 


Ala 
205 


Ala 


Ser 


Arg 


lie 


Tyr 
210 


Ala 


Val 


Glu 


Gly 


Gin 
215 


Cys 


Phe 


Val 


Leu 


Ala 
220 


Ser 


Cys 


Ala 


Leu 


Val 


Ser 


Pro 


Glu 


Met 


He 


Glu 


Met 


Leu 


Cys 


Thr 


Asp 


Glu 


Ser 


Lys 


His 


225 










230 










235 










240 


Ser 


Leu 


Leu 


Gin 


Ala 
245 


Gly 


Gly 


Gly 


Tyr 


Ser 
250 


Arg 


He 


He 


Gly 


Pro 
255 


Asp 


Gly 


Ser 


Asp 


Leu 


Ala 


Arg 


Pro 


Leu 


Gly 


Glu 


Asn 


Glu 


Glu 


Gly 


He 


Leu 






260 










265 










270 






Tyr 


Ala 


Thr 


Leu 


Asp 


Pro 


Ala 


Ala 


Arg 


He 


Tyr 


Ala 


Lys 


Thr 


i\_La 


ax a 




275 










280 










285 








Asp 


Pro 


Ala 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 


Val 


Thr 


Arg 


Leu 


Leu 


He 


290 










295 










300 










Asn Arq 


Ser 


Ala 


Asn 


Gin 


Pro 


Val 


Val 


Glu 


Val 


Gly Arg 


Glu 


He 


Pro 


305 










310 










315 










320 


Ala 


Ser 


Ala 


Gin 


Gly 
325 


Phe 


Glu 


Val 


Glu 


Ala 
330 


Ala 


Pro 


Gly 


Tyr 


Gly 
335 


Gly 


Asp 

































<210> 59 
<211> 987 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 59 

atgcgagata ggaatttcaa actggcggcc 
cgggcctcga cggaaaaggc ctgcagattg 
atcgcgggat tcagcgagac ctggcttccc 
gccgatccat ccctgctctg gaaggcttct 
cccggtcccg agacggatca attatgcgag 
atcggagtgg ttgaactcga cgagtttacg 
atcggcaaag aagggaagat cctgggaaag 
cgcacggtat ggggagaggg cgatgcgacg 
cggatcagcg gcctgaactg ctgggagcat 
tctcagggca cgcacattca tatcgcggcc 
gcgccgtctc cgatgtggga gcgccagctt 
gccgcatacg tgattctggt cggaggactc 
gatgaacttg ccgtcaagta ccggggagac 
atcgccgggc cggccaaggg ggaaaccatt 
ctcgcggcaa agtccgcctt cgatgtcgcg 
ctctgcgtca accgcaaacc gtaccggcgt 
gcttctgaaa gagaatcgga atcgtaa 

<210> 60 
<211> 328 



attcaggcgg agccggtttt 
atcaaagaag cgggcgcgat 
gggtatccct tttttatctg 
gcggaatacc tggccaatgc 
gcggcgaaaa aggccggcat 
aagggaacgg cttactgcac 
caccgcaaac tcaagccgac 
ggactcagtg tccatgagcg 
aatatggtcc tgcccggcta 
tggccgggtt cggaagggaa 
ctgctctccc gcgctttcgc 
ctgaacccgc agaatattcc 
agtttcatca tcgatccgcg 
ctcatcgccg aaggctcgat 
ggacattatt cccgccccga 
gtaagggaaa cttcggagca 
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ctttaatcgc 60 

gggcgccgat 120 

gggcgcaagc 18 0 

cgttcaaata 24 0 

cgatgtggcg 30 0 

gctgctcttc 3 60 

gcaccgggag 420 

tccttacggg 480 

tgtcctgatg 540 

agcacctccc 600 

ttcgcaatcc 660 

ggcgccctac 720 

cggggagatc 7 80 

ggaacaggtc 8 40 

cgtctttcaa 900 

ggaccaaccc 960 
987 
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<212> PRT 
<213> Unknown 



<220> 

<22 3> Obtained from an environmental sample 



<400> 60 




























Met 


Arg 


Asp 


Arg 


Asn 


Phe 


Lys 


Leu 


Ala 


Ala 


He 


Gin 


Ala 


Glu 


Pro 


Val 


1 






5 










10 










15 




Phe 


Phe 


Asn 


Arg 
20 


Arg 


Ala 


Ser 


Thr 


Glu 
25 


Lys 


Ala 


Cys 


Arg 


Leu 
30 


He 


Lys 


Glu 


Ala 


Gly 
35 


Ala 


Met 


Gly 


Ala 


Asp 
40 


lie 


Ala 


Gly 


Phe 


Ser 
45 


Glu 


Thr 


Trp 


Leu 


Pro 


Gly 


Tyr 


Pro 


Phe 


Phe 


lie 


Trp 


Gly 


Ala 


Ser 


Ala 


Asp 


Pro 


Ser 




50 








55 










60 










Leu 


Leu 


Trp 


Lys 


Ala 


Ser 


Ala 


Glu 


Tyr 


Leu 


Ala 


Asn 


Ala 


Val 


Gin 


He 


65 






70 










75 










8 0 


Pro 


Gly 


Pro 


Glu 


Thr 


Asp 


Gin 


Leu 


Cys 


Glu 


Ala 


Ala 


Lys 


Lys 


Ala 


Gly 








85 










90 














lie 


Asp 


Val 


Ala 


lie 


Gly 


Val 


Val 


Glu 


Leu 


Asp 


Glu 


Phe 


Thr 


Lys 


Gly 






100 










105 










11U 






Thr 


Ala 


Tyr 
115 


Cys 


Thr 


Leu 


Leu 


Phe 
120 


lie 


Gly 


Lys 


Glu 


Gly 

IOC 


Lys 


lie 


Leu 


Gly 


Lys 
130 


His 


Arg 


Lys 


Leu 


Lys 
135 


Pro 


Thr 


His 


Arg 


Cj_LU 

140 


Arg 


Thr 


vax 


Trp 


Gly 


Glu 


Gly 


Asp 


Ala 


Thr 


Gly 


Leu 


Ser 


Val 


His 


blU 


Arg 


Pro 


Tyr 


oxy 


145 








150 










155 










1 bu 


Arg 


lie 


Ser 


Gly 


Leu 


Asn 


Cys 


Trp 


Glu 


His 


Asn 


Met 


Val 


Leu 


Pro 


Gly 








165 










170 










1 lo 




Tyr 


Val 


Leu 


Met 


Ser 


Gin 


Gly 


Thr 


His 


He 


His 


He 


Ala 


Ala 


Trp 


Pro 






180 










185 










i y u 






Gly 


Ser 


Glu 


Gly 


Lys 


Ala 


Pro 


Pro 


Ala 


Pro 


Ser 


Pro 


Met 


Trp 


vjJLU 


Arg 




1 QR 










200 










one. 








Gin 


Leu 
210 


Leu 


Leu 


Ser 


Arg 


215 




nXd. 


O fci -L 


Gin 


Ser 
220 


Ala 


Ala 


Tvr 


Val 


lie 


Leu 


Val 


Gly 


Gly 


Leu 


Leu 


Asn 


Pro 


Gin 


Asn 


He 


Pro 


Ala 


Pro 


Tyr 


225 






230 










235 










240 


Asp 


Glu 


Leu 


Ala 


Val 


Lys 


Tyr 


Arg 


Gly 


Asp 


Ser 


Phe 


He 


He 


Asp 


Pro 








245 










250 










255 




Arg 


Gly 


Glu 


lie 


lie 


Ala 


Gly 


Pro 


Ala 


Lys 


Gly 


Glu 


Thr 


He 


Leu 


He 




260 










265 










270 






Ala 


Glu 


Gly 
275 


Ser 


Met 


Glu 


Gin 


Val 
280 


Leu 


Ala 


Ala 


Lys 


Ser 
285 


Ala 


Phe 


Asp 


Val 


Ala 
290 


Gly 


His 


Tyr 


Ser 


Arg 
295 


Pro 


Asp 


Val 


Phe 


Gin 
300 


Leu 


Cys 


Val 


Asn 


Arg 


Lys 


Pro 


Tyr 


Arg 


Arg 


Val 


Arg 


Glu 


Thr 


Ser 


Glu 


Gin 


Asp 


Gin 


Pro 


305 










310 










315 










320 


Ala 


Ser 


Glu 


Arg 


Glu 


Ser 


Glu 


Ser 



















325 



<210> 61 
<211> 966 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 61 

atgactcgat cttacccgaa tgacacactc acggttgggc ttgcgcaaat tgctccagtc 
tggttggatc gtacagggac aatttcaaag atattagctc aagtccatgc ggcaaatgaa 
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gcgggctgtc 
gagcgaacaa 
ctggatcagg 
gaatatgaga 
agtctgtatg 
aagttaatgc 
cgggtgcaca 
cctttggcac 
ggcggctccc 
gttttatctg 
gcatccatcc 
cctgacggca 
attgatcatt 
aggccagatg 
gagtaa 



atcttgtcgc 
atggtgcagt 
ctgtccagat 
tcgcaattgt 
caagccttgt 
caacatatga 
aattaggtgc 
gcgcggccat 
acaatacgca 
tgtcaggttt 
tggctaaagg 
aatggatcgt 
gtcgtgtgcg 
tattgcaact 



gtttggcgaa 
cttcaactcg 
cgaagcaggt 
cctgggatgc 
atatattgat 
agaacggctc 
ctttacggtt 
gtatggtcaa 
agacattaca 
aatgcgctca 
tgaggatatt 
tgagccgctt 
cgaagagcgt 
gaaaataaac 



ggtctcctcc 
cccacgcaga 
catcttgagg 
attgaacgtc 
tcagacggca 
acctggtcgc 
ggcggcctca 
ggcgaggatt 
cgctttattg 
ggcgattttc 
cttgccaacg 
gtaggagaag 
caaaattttg 
aggcaacgcc 



ccggatatcc 
aagaaattca 
cgctttgcga 
cgcaagatcg 
tcatccaatc 
caggtgacgg 
actgttggga 
tgcatattgc 
cactagaatc 
caaaagagac 
gtggttcatg 
agaagttaat 
atccttcagg 
agagtacaat 



gttttggatt 
cgcgcattat 
agcagccaag 
tggagggcac 
tgtgcatcga 
acatggatta 
aaactggatg 
catttggccc 
gcgttcctat 
cccacatctt 
tatcgccggt 
tgttgcaacg 
acattacagc 
ctcgtttgga 



<210> 62 
<211> 321 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from 



an environmental sample 



<400> 62 

Met Thr Arg Ser Tyr 

1 5 
lie Ala Pro Val Trp 
20 

Ala Gin Val His Ala 
35 

Gly Glu Gly Leu Leu 
50 

Gly Ala Val Phe Asn 
65 

Leu Asp Gin Ala Val 
85 

Glu Ala Ala Lys Glu 
100 

Arg Pro Gin Asp Arg 
115 

lie Asp Ser Asp Gly 
130 

Thr Tyr Glu Glu Arg 
145 

Arg Val His Lys Leu 
165 

Glu Asn Trp Met Pro 
180 

Asp Leu His lie Ala 
195 

lie Thr Arg Phe lie 
210 

Ser Gly Leu Met Arg 

225 

Ala Ser lie Leu Ala 
245 

Cys lie Ala Gly Pro 
260 

Glu Glu Lys Leu lie 
275 



Pro Asn Asp Thr 
Leu Asp 
Ala Asn 



Pro Gly 

55 
Ser Pro 
70 

Gin lie 

Tyr Glu 

Gly Gly 

He He 
135 
Leu Thr 
150 

Gly Ala 

Leu Ala 

He Trp 

Ala Leu 
215 
Ser Gly 
230 

Lys Gly 
Asp Gly 
Val Ala 



Arg Thr 

25 
Glu Ala 
40 

Tyr Pro 



Thr Gin 

Glu Ala 

He Ala 
105 
His Ser 
120 

Gin Ser 

Trp Ser 

Phe Thr 

Arg Ala 
185 
Pro Gly 
200 

Glu Ser 

Asp Phe 

Glu Asp 

Lys Trp 
265 
Thr He 
280 



Leu Thr Val 
10 

Gly Thr He 

Gly Cys His 

Phe Trp He 
60 

Lys Glu He 
75 

Gly His Leu 
90 

He Val Leu 

Leu Tyr Ala 

Val His Arg 
140 

Pro Gly Asp 
155 

Val Gly Gly 
170 

Ala Met Tyr 

Gly Ser His 

Arg Ser Tyr 
220 

Pro Lys Glu 

235 
He Leu Ala 
250 

He Val Glu 
Asp His Cys 



Gly 

Ser 

Leu 
45 
Glu 

His 

Glu 

Gly 

Ser 
125 
Lys 

Gly 

Leu 

Gly 

Asn 
205 
Val 

Thr 

Asn 

Pro 

Arg 
285 



Leu 

Lys 
30 
Val 

Arg 

Ala 

Ala 

Cys 
110 
Leu 

Leu 

His 

Asn 

Gin 
190 
Thr 

Leu 

Pro 

Gly 

Leu 
270 
Val 



Ala Gin 
15 

He Leu 

Ala Phe 

Thr Asn 

His Tyr 

80 
Leu Cys 
95 

He Glu 

Val Tyr 

Met Pro 

Gly Leu 
160 
Cys Trp 
175 

Gly Glu 

Gin Asp 

Ser Val 

His Leu 
240 
Gly Ser 
255 

Val Gly 
Arg Glu 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
966 
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Glu Arg Gin Asn 
290 

Leu Gin Leu Lys 

305 

Glu 



Phe Asp Pro Ser 
295 

lie Asn Arg Gin 
310 



Gly His Tyr Ser 
300 

Arg Gin Ser Thr 
315 



Arg Pro Asp Val 

lie Ser Phe Gly 
320 



<210> 63 
<211> 978 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 63 

atgcaagata 

gattgtgacg 

gcaaacctga 

ggtcgaggtc 

aacaattcgg 

gctcgagcca 

aacaccctgc 

cccaccaatc 

gatacaccct 

cgttatgcac 

gagatcactc 

gtgtgcatga 

gcggaggcag 

gtgctggcgg 

cgaattgtgg 

ctacgactgc 

gatccgtctg 



gagtaccgat 
cgaccgtgga 
tcgtgtttcc 
tggccacttt 
tggaaatccc 
ccgttgtcat 
tatttattgg 
acgaacggat 
gtggaaaagt 
tctatggaca 
tggtcaatgc 
tccttcgcaa 
atgacttcat 
gtccattgtg 
atgagagacg 
actttaatgc 
agggttaa 



tgtacgagct 
aaaagcctgc 
cgaagccttc 
tggcggacag 
tggtccggcc 
gggattgaat 
gccagacggt 
gatctggggt 
aggcggtctc 
gggcgaacaa 
acgcaatacc 
gtccagcttt 
aaaatcaggc 
gaatgaagag 
agtatttgat 
ttcccctcag 



gcggctatcc 
cgattgatcg 
attcccgttt 
cgccagaaat 
accgacaggc 
gagcgcgcgg 
cgcttgctgg 
atgggagatg 
atctgctggg 
atccatgtcg 
gcctatgagg 
ccccatgatt 
ggcagcgcga 
aatatactgt 
gtgacgggcc 
aaaactattg 



aggctgaacc 
gtgaagcagc 
atcccaatgc 
acgtatggac 
tggcaaaggc 
tcgataacaa 
gcaagcaccg 
ggagcaccct 
aaaactacat 
cgccgactgc 
gacgcttatt 
ttgagctggg 
tcgttgggcc 
atgccgatct 
attattcacg 
aaagatatga 



catagtcctt 
agaaaatggt 
ggcgatctgg 
gcgactatgg 
a'gcacacgag 
cacgctttac 
taagctcatg 
gcgggttttt 
gcctctggcg 
gcacgatggt 
cgtcatctcc 
cgaggaattg 
agatggcgag 
tgacttgaat 
tccagatgtt 
gcaacctctc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
978 



<210> 64 
<211> 325 
<212> PRT 
<213> Unknown 

<220> 1 

<223> Obtained from an environmental sample 



<400> 64 



Met 


Gin 


Asp 


Arg 


Val 


Pro 


He 


Val 


Arg 


Ala 


Ala 


Ala 


He 


Gin 


Ala 


Glu 


1 




5 










10 










15 




Pro 


He 


Val 


Leu 


Asp 


Cys 


Asp 


Ala 


Thr 


Val 


Glu 


Lys 


Ala 


Cys 


Arg 


Leu 








20 








25 










30 






He 


Gly 


Glu 


Ala 


Ala 


Glu 


Asn 


Gly 


Ala 


Asn 


Leu 


He 


Val 


Phe 


Pro 


Glu 




35 










40 










45 








Ala 


Phe 


He 


Pro 


Val 


Tyr 


Pro 


Asn 


Ala 


Ala 


He 


Trp 


Gly 


Arg 


Gly 


Leu 




50 








55 










60 










Ala 


Thr 


Phe 


Gly 


Gly Gin 


Arg 


Gin 


Lys 


Tyr 


Val 


Trp 


Thr 


Arg 


Leu 


Trp 


65 








70 










75 










80 


Asn 


Asn 


Ser 


Val 


Glu 
85 


He 


Pro 


Gly 


Pro 


Ala 
90 


Thr 


Asp 


Arg 


Leu 


Ala 
95 


Lys 


Ala 


Ala 


His 


Glu 


Ala 


Arg 


Ala 


Thr 


Val 


Val 


Met 


Gly 


Leu 


Asn 


Glu 


Arg 








100 








105 










110 






Ala 


Val 


Asp 


Asn 


Asn 


Thr 


Leu 


Tyr 


Asn 


Thr 


Leu 


Leu 


Phe 


He 


Gly 


Pro 






115 










120 










125 






His 


Asp 


Gly Arg 


Leu 


Leu 


Gly 


Lys 


His 


Arg 


Lys 


Leu 


Met 


Pro 


Thr 


Asn 




130 










135 










140" 
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Glu 


Arg 


Met 


lie 


Trp 


Gly 


Met 


Gly Asp 


Gly 


Ser 


Thr 


Leu 


Arg 


Val 


Phe 


145 








150 










155 










1 bu 


Asp 


Thr 


Pro 


Cys 


Gly 


Lys 


Val 


Gly 


Gly 


Leu 


He 


Cys 


Trp 


Glu 


Asn 


Tyr 








165 










170 










175 




Met 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ala 


Leu 


Tyr 


Gly 


Gin 


Gly 


Glu 


Gin 


He 


His 








180 










185 










190 






Val 


Ala 


Pro 


Thr 


Ala 


His 


Asp 


Gly 


Glu 


He 


Thr 


Leu 


Val 


Asn 


Ala 


Arg 






195 










200 










205 








Asn 


Thr 


Ala 


Tyr 


Glu 


Gly Arg 


Leu 


Phe 


Val 


He 


Ser 


Val 


Cys 


Met 


He 




210 










215 










220 










Leu 


Arg 


Lys 


Ser 


Ser 


Phe 


Pro 


His 


Asp 


Phe 


Glu 


Leu 


Gly 


Glu 


Glu 


Leu 


225 






230 










235 










240 


Ala 


Glu 


Ala 


Asp 


Asp 


Phe 


He 


Lys 


Ser 


Gly 


Gly 


Ser 


Ala 


He 


Val 


Gly 








245 










250 










255 




Pro 


Asp 


Gly 


Glu 


Val 


Leu 


Ala 


Gly 


Pro 


Leu 


Trp 


Asn 


(J J- U 


C\ n 

O-L Li 


Asn 


He 




260 










265 










270 






Leu 


Tyr 


Ala 


Asp 


Leu 


Asp 


Leu 


Asn 


Arg 


He 


Val 


Asp 


Glu 


Arg 


Arg 


Val 




275 










280 










285 








Phe 


Asp 


Val 


Thr 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 


Val 


Leu 


Arg 


Leu 


His 




290 










295 










300 










Phe 


Asn 


Ala 


Ser 


Pro 


Gin 


Lys 


Thr 


He 


Glu 


Arg 


Tyr 


Glu 


Gin 


Pro 


Leu 


305 










310 










315 










320 


Asp 


Pro 


Ser 


Glu 


Gly 

























325 



<210> 65 
<211> 1002 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 65 

atgccgaccc 

gaeegggaag 

gcgcggctga 

gtcccgccgg 

gtggaagtcc 

tatgtegtea 

acgctcctct 

aceggegggg 

actcccctgg 

tacacgatgt 

acgtggcttg 

tgcatcgcgc 

teggggtege 

ttcattgccg 

ctgatggcgg 

tttgaactca 

ggggccggaa 



ccacttcaaa 
ccactgcgca 
tcgttttccc 
gaagaggaaa 
ccgggccggt 
tgggegtcac 
atttcaaccc 
agcggctggt 
gaaaaategg 
atgcctgggg 
ccaccctgcg 
tgcggcggga 
gggaatggat 
gccccgtgcg 
gatcgaaatg 
tcgtccaccg 
gaaccgggga 



attcaaaatc 
aaaagcctgc 
ggagtctttc 
agtgttaagc 
caccgatcag 
ggaaaaggac 
cgegggggae 
ctgggcgcag 
aggcctcatc 
gacccagatt 
geatatcget 
cgacatcccc 
caatatgggg 
gatgaaggag 
gatgetcgae 
ccagccccac 
cgagaagaag 



ggcgccgtgc 
aaattgattg 
attcccacct 
gaactttacg 
ctgggtgaag 
accgacgcaa 
ctcctgggaa 
ggegaeggea 
tgctgggaaa 
tatatcgegg 
aaggaaggac 
gaeeggctgg 
gacagcgcca 
gagatcctgt 
gtcgegggge 
ccgatgatcc 
gaaaatgagt 



aggcatcgcc 
ccgaagcggg 
atcctgattg 
ccgagctgct 
cagcccaaaa 
geggegegag 
aacaceggaa 
gcaccctgga 
actacatgcc 
ccacatggga 
gggtgtacgt 
aatacaagaa 
tcgtgaaccc 
atgccgaggt 
attacgegeg 
gggtaatcga 
ga 



ggtttttctg 
agggcagggc 
ggtctgggcg 
ggccaatgcc 
aacgggcgcc 
cctttacaac 
gcttgttcct 
agtgtacgac 
cctcgcccgg 
ccagggggag 
catcggctgc 
gaagttctac 
ggaaggcgaa 
ggaccccctc 
ccccgacgtc 
gaaagaggga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1002 



<210> 66 
<211> 333 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 66 
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Met Pro Thr Pro 
1 

Pro Val Phe Leu 
20 

He Ala Glu Ala 
35 

Ser Phe lie Pro 
50 

Arg Gly Lys Val 
65 

Val Glu Val Pro 

Lys Thr Gly Ala 
100 

Ala Ser Gly Ala 
115 

Gly Asp Leu Leu 
130 

Arg Leu Val Trp 
145 

Thr Pro Leu Gly 

Pro Leu Ala Arg 
180 

Ala Ala Thr Trp 
195 

He Ala Lys Glu 
210 

Arg Arg Asp Asp 
225 

Ser Gly Ser Arg 

Pro Glu Gly Glu 
260 

Leu Tyr Ala Glu 
275 

Leu Asp Val Ala 
290 

Val His Arg Gin 
305 

Gly Ala Gly Arg 



Thr Ser Lys Phe 
5 

Asp Arg Glu Ala 

Gly Gly Gin Gly 
40 

Thr Tyr Pro Asp 
55 

Leu Ser Glu Leu 
70 

Gly Pro Val Thr 
85 

Tyr Val Val Met 

Ser Leu Tyr Asn 
120 

Gly Lys His Arg 
135 

Ala Gin Gly Asp 
150 

Lys He Gly Gly 
165 

Tyr Thr Met Tyr 

Asp Gin Gly Glu 
200 

Gly Arg Val Tyr 
215 

He Pro Asp Arg 

' 230 
Glu Trp He Asn 
245 

Phe He Ala Gly 

Val Asp Pro Leu 
280 

Gly His Tyr Ala 
295 

Pro His Pro Met 
310 

Thr Gly Asp Glu 
325 



Lys He Gly Ala 
10 

Thr Ala Gin Lys 
25 

Ala Arg Leu He 

Trp Val Trp Ala 
60 

Tyr Ala Glu Leu 
75 

Asp Gin Leu Gly 
90 

Gly Val Thr Glu 
105 

Thr Leu Leu Tyr 

Lys Leu Val Pro 
140 

Gly Ser Thr Leu 
155 

Leu He Cys Trp 
170 

Ala Trp Gly Thr 
185 

Thr Trp Leu Ala 

Val He Gly Cys 
220 

Leu Glu Tyr Lys 
235 

Met Gly Asp Ser 
250 

Pro Val Arg Met 
265 

Leu Met Ala Gly 

Arg Prp Asp Val 
300 

He Arg Val He 
315 

Lys Lys Glu Asn 
330 



Val Gin Ala Ser 
15 

Ala Cys Lys Leu 
30 

Val Phe Pro Glu 
45 

Val Pro Pro Gly 

Leu Ala Asn Ala 
80 

Glu Ala Ala Gin 
95 

Lys Asp Thr -Asp 
110 

Phe Asn Pro Ala 
125 

Thr Gly Gly Glu 

Glu Val Tyr Asp 

160 

Glu Asn Tyr Met 
175 

Gin He Tyr He 
190 

Thr Leu Arg His 
205 

Cys He Ala Leu 

Lys Lys Phe Tyr 
240 

Ala He Val Asn 
255 

Lys Glu Glu He 
270 

Ser Lys Trp Met 
285 

Phe Glu Leu He 

Glu Lys Glu Gly 
320 

Glu 



<210> 67 
<211> 936 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 67 

atgccccgtg 

gagaacgccg 

ccggaagccc 

atggcgctca 

gacgagctcg 

ctgcacgagc 

ggcgccgacg 

atggtctggg 

gtcggcgccc 



tggcggtggt 
tggcttcgct 
tggttcccgg 
ccgagcggat 
cgccgctgcg 
gcgacgaggc 
gcgcggtgct 
gctgcggcga 
tgatctgctg 



ccagcgcccg 
cgccgaggcc 
ctatccggcg 
tcacgcgcgc 
cgaggcggcc 
gctcggcggc 
caaccgccac 
tgccagcggg 
ggaaagctac 



ccggtgtttc 
gcgtcgaacg 
tggatgtggc 
ttgcgggcga 
cggcgccacg 
ggcacgctct 
cggaagctga 
ctcaggacgg 
atgccgcttg 



tcgaccgcgc 
gggctcgcct 
ggctgcggcc 
actcggtgag 
agctcaccgt 
ataacaccgt 
tgcccaccaa 
tccccaccca 
cacgctacgc 



cgcgaccctc 
cgcggtcttt 
cgggcccgac 
cctcgccgcc 
agtgtgcggc 
cgtcacgatc 
ccccgagcgc 
gtgcgggcgc 
gctgtacgcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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cagggaatcg 
atgcagcaca 
gcgcgcgacg 
tggatcaacc 
ctgcacaaag 
cgccgcagcc 
aacgcccgcc 



acctctacgt 
ttgcccgcga 
tcccggacgc 
cgggcgactc 
aacgcgcgat 
tggacgtggt 
cgcaaagcgt 



cacgccgacc 
aggcggctgc 
gtttccgggg 
ggtcgtggtc 
cctgtacgcc 
cggccattat 
ggttgaattg 



tacgacagcg 
tgggtggtga 
aagagcgagc 
gcgccgggcg 
gagatcgacc 
gcgcggcccg 
cgctga 



gcgagcgggc 
gctgcggctc 
ttttccgcga 
gcaaggtcgt 
tcgagcgggt 
acctcttcga 



ggttgcgacc 
ggcgtttcag 
caacgacgag 
cgccgggccg 
cggcgtggcg 
cctgcacgtg 



600 
660 
720 
780 
840 
900 
936 



<210> 68 
<211> 311 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 68 
Met Pro Arg Val 
1 

Ala Ala Thr Leu 
20 

Asn Gly Ala Arg 
35 

Pro Ala Trp Met 
50 

Glu Arg lie His 
65 

Asp Glu Leu Ala 

Val Val Cys Gly 
100 

Leu Tyr Asn Thr 
115 

Arg His Arg Lys 
130 

Cys Gly Asp Ala 
145 

Val Gly Ala Leu 

Ala Leu Tyr Ala 
180 

Ser Gly Glu Arg 
195 

Gly Cys Trp Val 
210 

Pro Asp Ala Phe 
225 

Trp lie Asn Pro 

Val Ala Gly Pro 
260 

Asp Leu Glu Arg 
275 

His Tyr Ala Arg 
290 

Gin Ser Val Val 
305 



Ala Val Val Gin 

5 

Glu Asn Ala Val 

Leu Ala Val Phe 
40 

Trp Arg Leu Arg 
55 

Ala Arg Leu Arg 
70 

Pro Leu Arg Glu 
85 

Leu His Glu Arg 

Val Val Thr lie 
120 

Leu Met Pro Thr 
135 

Ser Gly Leu Arg 
150 

lie Cys Trp Glu 
165 

Gin Gly lie Asp 

Ala Val Ala Thr 
200 

Val Ser Cys Gly 
215 

Pro Gly Lys Ser 
230 

Gly Asp Ser Val 
245 

Leu His Lys Glu 

Val Gly Val Ala 
280 

Pro Asp Leu Phe 
295 

Glu Leu Arg 
310 



Arg Pro Pro Val 
10 

Ala Ser Leu Ala 
25 

Pro Glu Ala Leu 

Pro Gly Pro Asp 
60 

Ala Asn Ser Val 
75 

Ala Ala Arg Arg 
90 

Asp Glu Ala Leu 
105 

Gly Ala Asp Gly 

Asn Pro Glu Arg 
140 

Thr Val Pro Thr 
155 

Ser Tyr Met Pro 
170 

Leu Tyr Val Thr 
185 

Met Gin His lie 

Ser Ala Phe Gin 
220 

Glu Leu Phe Arg 
235 

Val Val Ala Pro 
250 

Arg Ala lie Leu 
265 

Arg Arg Ser Leu 

Asp Leu His Val 
300 



Phe Leu Asp Arg 
15 

Glu Ala Ala Ser 
30 

Val Pro Gly Tyr 
45 

Met Ala Leu Thr 

Ser Leu Ala Ala 
80 

His Glu Leu Thr 
95 

Gly Gly Gly Thr 
110 

Ala Val Leu Asn 

125 

Met Val Trp Gly 

Gin Cys Gly Arg 
160 

Leu Ala Arg Tyr 
175 

Pro Thr Tyr Asp 
190 

Ala Arg Glu Gly 
205 

Ala Arg Asp Val 

Asp Asn Asp Glu 
240 

Gly Gly Lys Val 
255 

Tyr Ala Glu lie 
270 

Asp Val Val Gly 
285 

Asn Ala Arg Pro 



<210> 69 
<211> 939 
<212> DNA 
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<213> Unknown 
<220> 

<223> Obtained from an environmental sample 



<400> 69 

gtgaccgagt 

gccgatgcct 

cagctcgtcg 

gcggccgccg 

gacgtcccgg 

tgcgtgatcg 

ctgaccctcg 

gagcggctgt 

ggacggatcg 

taccagggtg 

gcgagcatgc 

atcccggcgt 

ttcggccgcg 

ctctacgatc 

aagcgctggt 

gagcgcgtcc 



ttcggacggt 
cggtcgagaa 
tgctgcccga 
gattcggcgg 
gcccgctggt 
gcgtgaacga 
gcccgtcggg 
tccatgggat 
ggggactgat 
gaccgcagat 
gccacatcgc 
ccgcgttccc 
gcggtgcggc 
gggaggggat 
tcgactccgt 
cggcgccggt 



gcgggtcgca 
ggcgatcggg 
ggccttcgtg 
cttcgacgag 
cgaccggctg 
gcgcgaaagc 
cctcctgcac 
cggcgacggt 
ctgctgggag 
ctgggtcgcg 
gatcgagtcg 
cgacgatttc 
gatcgtcgag 
cgtgttcgcc 
cggccattac 
ggacggcgaa 



gcggtgcagg 
ctgatcggcg 
tcgctctacc 
ctctgggagc 
gtcgatgcgt 
gaaaggccgg 
cggcaccgca 
caagacctcg 
aaccgaatgc 
ccgacggccg 
ggcgcgttcg 
cccgtcgagc 
ccgacctggg 
gactgtgacc 
agccgcgcgg 
tcgccgtga 



cgacgccggt 
aggcggtggc 
cgtcgaacgc 
ggatgtgggc 
gccgcaggca 
ggtcgcttta 
agctcatgcc 
gcgttgtgga 
cgctcgcgcg 
atgactccga 
tcgtgtcggt 
taccgccggg 
gcgaggtaat 
tgcgacgcgg 
aggtgctcga 



gaccctcgac 
cggtggagcg 
gtgggcgcga 
cagctcgctc 
tgacgtggta 
caacacgatg 
gacgcaccac 
gaccgacgcg 
ctacgcggtc 
cggctggctc 
gccgcagttc 
caaggaggtg 
cgccgggccg 
cttgcatgcc 
tggcggcgtc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
939 



<210> 70 
<211> 312 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
<400> 70 

Val Thr Glu Phe Arg Thr Val Arg Val Ala Ala Val 

15 10 
Val Thr Leu Asp Ala Asp Ala Ser Val Glu Lys Ala 

20 25 
Gly Glu Ala Val Ala Gly Gly Ala Gin Leu Val Val 

35 40 
Phe Val Ser Leu Tyr Pro Ser Asn Ala Trp Ala Arg 

50 55 60 

Phe Gly Gly Phe Asp Glu Leu Trp Glu Arg Met Trp 
65 70 75 

Asp Val Pro Gly Pro Leu Val Asp Arg Leu Val Asp 

85 90 
His Asp Val Val Cys Val lie Gly Val Asn Glu Arg 

100 105 
Pro Gly Ser Leu Tyr Asn Thr Met Leu Thr Leu Gly 

115 120 
Leu His Arg His Arg Lys Leu Met Pro Thr His His 
130 135 140 

His Gly He Gly Asp Gly Gin Asp Leu Gly Val Val 
145 150 155 

Gly Arg He Gly Gly Leu He Cys Trp Glu Asn Arg 

165 170 
Arg Tyr Ala Val Tyr Gin Gly Gly Pro Gin He Trp 

180 185 
Ala Asp Asp Ser Asp Gly Trp Leu Ala Ser Met Arg 

195 200 
Glu Ser Gly Ala Phe Val Val Ser Val Pro Gin Phe 
210 215 220 

Ala Phe Pro Asp Asp Phe Pro Val Glu Leu Pro Pro 



Gin Ala 

He Gly 

30 
Leu Pro 
45 

Ala Ala 

Ala Ser 

Ala Cys 

Glu Ser 
110 
Pro Ser 
125 

Glu Arg 
Glu Thr 
Met Pro 



Val Ala 
190 
His He 
205 

He Pro 



Thr Pro 
15 

Leu He 

Glu Ala 

Ala Gly 

Ser Leu 

80 
Arg Arg 
95 

Glu Arg 

Gly Leu 

Leu Phe 

Asp Ala 
160 
Leu Ala 
175 

Pro Thr 



Ala He 
Ala Ser 
Gly Lys Glu Val 
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225 

Phe Gly Arg Gly 

lie Ala Gly Pro 
260 

Asp Leu Arg Arg 
275 

His Tyr Ser Arg 
290 

Ala Pro Val Asp 
305 



230 

Gly Ala Ala lie 
245 

Leu Tyr Asp Arg 

Gly Leu His Ala 
280 

Ala Glu Val Leu 
295 

Gly Glu Ser Pro 
310 



235 

Val Glu Pro Thr 
250 

Glu Gly He Val 
265 

Lys Arg Trp Phe 

Asp Gly Gly Val 
300 



240 

Trp Gly Glu Val 
255 

Phe Ala Asp Cys 
270 

Asp Ser Val Gly 
285 

Glu Arg Val Pro 



<210> 71 
<211> 966 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 71 

atgccaaacg 

tttcttgatc 

gaaggggcac 

tgggcaattc 

aactcggtca 

aatgcctacg 

tataacacgc 

gtgccaacgg 

tatgatactc 

gcccgctata 

ggtcagccct 

ggttgttgta 

ttttactcag 

gggcaattta 

ccgcgcatgg 

gatgtgttcg 

tcataa 



agaacaccaa 
gtgccgcaac 
gcttgattgt 
cttcgggtga 
cgattcccag 
tggtgatggg 
tattgtatat 
gcggcgagcg 
ccttagggaa 
ccatgtatgc 
ggctctctac 
tcgcgatgcg 
atgcagatga 
tcgctggacc 
tccaagggcc 
aactgattgt 



cgccacattc 
aatcgacaag 
ctttccagaa 
agagggcgta 
tgactcgacg 
tatgagcgaa 
tgatgcacag 
gctagtctgg 
actcggtggc 
ctggggcaca 
tttacgccac 
taaagacgat 
gtggattaat 
ggtgcgcaag 
gaagtggatg 
ccacacggat 



aaagttgccg 
gcttgcgatt 
gcattcatcc 
ctcaatgagt 
gacaaactgt 
cgcaatgctg 
ggggagattc 
gcgcagggcg 
ttaatttgct 
caaatctatg 
attgccaaag 
atcccagacc 
attggcgata 
caggaagaga 
ctcgacgtgg 
attcgaagga 



ctgtgcaggc 
tgatcgccgc 
cgtcttatcc 
tgtacgcaga 
gcagagcagc 
aggcaagcgg 
tgggcaagca 
atggcagtac 
gggagaatta 
tcgcggcaac 
aaggcagggt 
attatacaat 
gtgcgattgt 
ttctctatgc 
cgggacatta 
tgatcaaatc 



ttcgcctgtg 
tgctggcggt 
tgattgggta 
tctgctatcc 
caggcttgct 
cgcgagcatg 
tcggaagttg 
actgcaggtc 
tatgccactg 
gtgggatcgg 
gtatgtgatt 
gaaacagaag 
taatcccgaa 
ggagattgat 
tgccaggccg 
ggaaaagaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
966 



<210> 72 
<211> 321 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 72 



Met 


Pro 


Asn 


Glu 


Asn 


Thr 


Asn 


Ala 


Thr 


Phe 


Lys 


Val 


Ala 


Ala 


Val 


Gin 


1 








5 










10 










15 




Ala 


Ser 


Pro 


Val 


Phe 


Leu 


Asp 


Arg 


Ala 


Ala 


Thr 


He 


Asp 


Lys 


Ala 


Cys 








20 










25 










30 






Asp 


Leu 


He 


Ala 


Ala 


Ala 


Gly 


Gly 


Glu 


Gly 


Ala 


Arg 


Leu 


He 


Val 


Phe 




35 










40 










45 








Pro 


Glu 


Ala 


Phe 


He 


Pro 


Ser 


Tyr 


Pro 


Asp 


Trp 


Val 


Trp 


Ala 


He 


Pro 




50 










55 










60 










Ser 


Gly 


Glu 


Glu 


Gly 


Val 


Leu 


Asn 


Glu 


Leu 


Tyr 


Ala 


Asp 


Leu 


Leu 


Ser 


65 








70 










75 










80 


Asn 


Ser 


Val 


Thr 


He 


Pro 


Ser 


Asp 


Ser 


Thr 


Asp 


Lys 


Leu 


Cys 


Arg 


Ala 










85 










90 










95 




Ala 


Arg 


Leu 


Ala 


Asn 


Ala 


Tyr 


Val 


Val 


Met 


Gly 


Met 


Ser 


Glu 


Arg 


Asn 
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100 



Ala 


Glu 


Ala 


Ser 


Gly 


Ala 


Ser 


Met 






115 










120 


Ala 


Gin 


Gly 


Glu 


He 


Leu 


Gly 


Lys 




130 










135 




Gly 


Glu 


Arg 


Leu 


Val 


Trp 


Ala 


Gin 


145 










150 






Tvr 


Asp 


Thr 


Pro 


Leu 


Gly 


Lys 


Leu 








165 








Tvr 


Met 


Pro 


Leu 


Ala 


Arg 


Tyr 


Thr 






180 










Tvr 
x jr J - 


Val 


Ala 


Ala 


Thr 


Trp 


Asp 


Arg 




195 










200 


Arg 


His 


He 


Ala 


Lys 


Glu 


Gly 


Arg 


210 










215 




Ala 


Met 


Arg 


Lys 


Asp 


Asp 


He 


Pro 


225 










230 






Phe 


Tyr 


Ser 


Asp 


Ala 


Asp 


Glu 


Trp 










245 








Val 


Asn 


Pro 


Glu 


Gly 


Gin 


Phe 


He 








260 










Glu 


lie 


Leu 


Tyr 


Ala 


Glu 


He 


Asp 






275 










280 


Trp 


Met 


Leu 


Asp 


Val 


Ala 


Gly 


His 


290 










295 




Leu 


He 


Val 


His 


Thr 


Asp 


He 


Arg 


305 










310 






Ser 

















105 HO 
Tyr Asn Thr Leu Leu Tyr He Asp 
125 

His Arg Lys Leu Val Pro Thr Gly 
140 

Gly Asp Gly Ser Thr Leu Gin Val 
155 160 
Gly Gly Leu He Cys Trp Glu Asn 

170 175 
Met Tyr Ala Trp Gly Thr Gin He 
185 190 
Gly Gin Pro Trp Leu Ser Thr Leu 
205 

Val Tyr Val He Gly Cys Cys He 
220 

Asp His Tyr Thr Met Lys Gin Lys 
235 240 
He Asn He Gly Asp Ser Ala He 

250 255 
Ala Gly Pro Val Arg Lys Gin Glu 
265 270 
Pro Arg Met Val Gin Gly Pro Lys 
285 

Tyr Ala Arg Pro Asp Val Phe Glu 
300 

Arg Met He Lys Ser Glu Lys Asn 
315 320 



<210> 73 
<211> 1035 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 73 

atgacagcaa 

aatcgcgacg 

gcgcgtctga 

gtccctgccg 

gtgacgattc 

tatgtcgtga 

tcgctcctct 

acaggcggag 

acggagctgg 

tacgcgatgt 

ccctggactg 

tgccaggccc 

gccaacgcac 

ttccttgtcg 

aacttccgcg 

ttccaactca 

gaccagaagg 

cagcaccacg 

<210> 74 
<211> 344 
<212> PRT 



tagactcaac 
caaccgtgga 
tcgttttccc 
gtgagcaagg 
ccagcgacat 
ttggtgtcaa 
acatcgacgc 
aacggctcgt 
gcaaactcgg 
acgcatgggg 
ccacgctgcg 
tgcacaagga 
gagagtggat 
agcccgtccg 
gcccgaagtg 
cagtgaaccg 
agcggccgct 
actga 



gtttaaagtc 
gaaggcgtgc 
ggaagcgttc 
cctgctcaac 
caccaccgag 
cgagcgcaac 
aaacggcaaa 
gtgggcgcag 
cggtctcatt 
agtgcagctc 
tcatgtcgcc 
tgacctgccg 
caatgttggc 
aatgcgggaa 
gatgttcgac 
cgagcagcgg 
cccggacgac 



gccgccgttc 
cggctgatca 
ataccggcct 
gacctctacg 
ttatgtaacg 
gcggaggcaa 
attctcggta 
ggcgatggca 
tgctgggaga 
tatgtcgccg 
aaggaaggtc 
gagctagacg 
gacagcgcta 
gacatcctct 
gcggctggac 
ccgatggtcc 
ggacggctct 



aggctgcgcc 
agtccgcggc 
acccggactg 
gccaactcgt 
cggcacgggc 
gcaatggaag 
agcaccgcaa 
gcacgctcga 
actatatgcc 
cgacctggga 
agatgtacgt 
ggctgaagga 
ttgtcggccc 
acgccgaggt 
actacgcgcg 
gcgtcgtcgg 
ggtacgccta 



ggtcttcctc 
agagggaggc 
ggtgtggacg 
cgaccagtcc 
agcaaacgcc 
cctctacaac 
gctcgttccc 
agcctacgac 
gctggcacgc 
ccgtggcggc 
catcgggtgc 
gaagtactac 
ggacggacaa 
ggacactcgc 
tcccgacatt 
tgacagcagt 
cagcaccaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1035 
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<213> Unknown 
<220> 

<223> Obtained from an environmental sample 
<400> 74 

Met Thr Ala He Asp Ser Thr Phe Lys Val Ala Ala Val Gin Ala Ala 

15 10 15 

Pro Val Phe Leu Asn Arg Asp Ala Thr Val Glu Lys Ala Cys Arg Leu 

20 25 30 

He Lys Ser Ala Ala Glu Gly Gly Ala Arg Leu lie Val Phe Pro Glu 

35 40 45 

Ala Phe lie Pro Ala Tyr Pro Asp Trp Val Trp Thr Val Pro Ala Gly 

50 55 60 

Glu Gin Gly Leu Leu Asn Asp Leu Tyr Gly Gin Leu Val Asp Gin Ser 
65 70 75 80 

Val Thr lie Pro Ser Asp He Thr Thr Glu Leu Cys Asn Ala Ala Arg 

85 90 95 

Ala Ala Asn Ala Tyr Val Val He Gly Val Asn Glu Arg Asn Ala Glu 

100 105 110 

Ala Ser Asn Gly Ser Leu Tyr Asn Ser Leu Leu Tyr He Asp Ala Asn 

115 120 125 

Gly Lys He Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly Glu 

130 135 140 

Arg Leu Val Trp Ala Gin Gly Asp Gly Ser Thr Leu Glu Ala Tyr Asp 
145 150 155 160 

Thr Glu Leu Gly Lys Leu Gly Gly Leu He Cys Trp Glu Asn Tyr Met 

165 170 175 

Pro Leu Ala Arg Tyr Ala Met Tyr Ala Trp Gly Val Gin Leu Tyr Val 

180 185 190 

Ala Ala Thr Trp Asp Arg Gly Gly Pro Trp Thr Ala Thr Leu Arg His 

195 200 205 

Val Ala Lys Glu Gly Gin Met Tyr Val He Gly Cys Cys Gin Ala Leu 

210 215 220 

His Lys Asp Asp Leu Pro Glu Leu Asp Gly Leu Lys Glu Lys Tyr Tyr 
225 230 235 240 

Ala Asn Ala Arg Glu Trp He Asn Val Gly Asp Ser Ala He Val Gly 

245 250 255 

Pro Asp Gly Gin Phe Leu Val Glu Pro Val Arg Met Arg Glu Asp He 

260 265 270 

Leu Tyr Ala Glu Val Asp Thr Arg Asn Phe Arg Gly Pro Lys Trp Met 

275 280 285 

Phe Asp Ala Ala Gly His Tyr Ala Arg Pro Asp He Phe Gin Leu Thr 

290 295 300 

Val Asn Arg Glu Gin Arg Pro Met Val Arg Val Val Gly Asp Ser Ser 
305 310 315 320 

Asp Gin Lys Glu Arg Pro Leu Pro Asp Asp Gly Arg Leu Trp Tyr Ala 

325 330 335 

Tyr Ser Thr Asn Gin His His Asp 
340 

<210> 75 
<211> H25 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 75 

atgagcacca ttgttaaagc cgctgcggtt caaatcagcc cagtcctcta cagccgcgag 



60 
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gggacagtcg 

gccacgttcc 

attcaactat 

tccacgacta 

ggcgtgaatg 

gggaccttga 

ggccagggag 

cttgcatgct 

caaatccatt 

gaaatcaata 

tggcttgacg 

atctcgggcg 

ctccggtcgg 

aagatgttga 

gaccgcaccc 

caagaggcga 

ctcaatgcgg 

gcggcccact 



caaaagttgt 

cggagaccgt 

tgtccggaac 

ccgacgcaat 

agcgtgacgg 

ttcagcgtcg 

atggttcggg 

ttgagcacaa 

cggccatgta 

tccgccagca 

ccagccagca 

gttgctttac 

gtgagggcga 

tggactcgcg 

ccaccgcgca 

cgaaagtggg 

gagcgggccg 

cgggcacccg 



gcggaagatc 

ggttccctac 

cgagtacctg 

cggggaggct 

cgggaccctg 

ccgcaagatc 

tttgcgggcc 

caacccactg 

tccaggttcc 

tgcgttggag 

ggcacaaatc 

cacgatcgta 

ggtcatcgcc 

cggccactac 

cattcatgag 

ttcacacgcg 

acatacaggg 

tggaattgaa 



cacgaacttg 

tatccatatt 

aagttgctcg 

gcccggaacg 

tacaacgcgc 

actcctacgc 

gtcaagagcc 

gcgcgttacg 

gcgttcggcg 

tccgggtgct 

atgaatgaca 

acacccgacg 

gatctcgatt 

agtcgcccgg 

cgaggtgcgc 

ccgctcctgc 

aatggagaag 

gcgaagggag 



gccaaaaggg 

tttccgccgt 

accaaggcgt 

ccggcatggt 

agttgctctt 

attacgagcg 

aggttggtcg 

cgatgatggc 

aggggttcgc 

tcgttgtgaa 

cgggttgcca 

gcacgtttct 

tcaagctgat 

aattgctcag 

cgcagacgtc 

gtgacggaca 

cacagataga 

cctaa 



ggtgcggttc 
ccagaccccc 
gaccgtgccg 
tgtatctatc 
cgatgcggat 
catgatctgg 
tattggccaa 
cgatggcgag 
ggaaaagatg 
tgcaacggcc 
aatcggtccg 
gggcgaacct 
cgacaaacgc 
tctgctgatc 
aggcgctgtg 
atgggatcag 
aatcatggcc 



<210> 76 
<211> 374 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from 



an environmental sample 



<400> 76 

Met Ser Thr lie Val 

1 5 
Tyr Ser Arg Glu Gly 

20 

Leu Gly Gin Lys Gly 
35 

Pro Tyr Tyr Pro Tyr 
50 

Ser Gly Thr Glu Tyr 
65 

Ser Thr Thr Thr Asp 
85 

Val Val Ser lie Gly 
100 

Ala Gin Leu Leu Phe 
115 

Lys lie Thr Pro Thr 
130 

Gly Ser Gly Leu Arg 
145 

Leu Ala Cys Phe Glu 
165 

Ala Asp Gly Glu Gin 
180 

Gly Glu Gly Phe Ala 
195 

Leu Glu Ser Gly Cys 
210 

Ser Gin Gin Ala Gin 
225 

lie Ser Gly Gly Cys 
245 

Leu Gly Glu Pro Leu 



Lys Ala Ala 

Thr Val Ala 

Val Arg Phe 
40 

Phe Ser Ala 
55 

Leu Lys Leu 
70 

Ala lie Gly 

Val Asn Glu 

Asp Ala Asp 
120 

His Tyr Glu 

135 
Ala Val Lys 
150 

His Asn Asn 

lie His Ser 

Glu Lys Met 
200 

Phe Val Val 

215 
lie Met Asn 
230 

Phe Thr Thr 
Arg Ser Gly 



Ala Val 

10 
Lys Val 
25 

Ala Thr 

Val Gin 

Leu Asp 

Glu Ala 
90 

Arg Asp 
105 

Gly Thr 

Arg Met 

Ser Gin 

Pro Leu 
170 
Ala Met 
185 

Glu He 

Asn Ala 

Asp Thr 

He Val 
250 
Glu Gly 



Gin lie Ser Pro 
Val Arg 
Phe Pro 



Thr Pro 
60 

Gin Gly 
75 

Ala Arg 

Gly Gly 

Leu He 

He Trp 
140 
Val Gly 
155 

Ala Arg 

Tyr Pro 

Asn He 

Thr Ala 
220 
Gly Cys 
235 

Thr Pro 



Lys He 

30 
Glu Thr 
45 

He Gin 



Val Thr 

Asn Ala 

Thr Leu 
110 
Gin Arg 
125 

Gly Gin 

Arg He 

Tyr Ala 

Gly Ser 
190 
Arg Gin 
205 

Trp Leu 



Val Leu 
15 

His Glu 

Val Val 

Leu Leu 

Val Pro 

80 
Gly Met 
95 

Tyr Asn 

Arg Arg 

Gly Asp 

Gly Gin 
160 
Met Met 
175 

Ala Phe 
His Ala 
Asp Ala 



Gin He 
Asp Gly 
Glu Val He Ala 



Gly Pro 
240 
Thr Phe 
255 

Asp Leu 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1125 
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260 



Asp 


Phe 


Lys 
275 


Leu 


He 


Asp 


Lys 


Arg 
280 


His 


Tyr 

290 


Ser 


Arg 


Pro 


Glu 


Leu 
295 


Leu 


Thr 


Ala 


His 


lie 


His 


Glu 


Arg 


Gly 


305 










310 






Gin 


Glu 


Ala 


Thr 


Lys 
325 


Val 


Gly 


Ser 


Gin 


Trp 


Asp 


Gin 
340 


Leu 


Asn 


Ala 


Gly 


Glu 


Ala 


Gin 
355 


He 


Glu 


He 


Met 


Ala 
360 


lie 


Glu 
370 


Ala 


Lys 


Gly 


Ala 







265 










270 






Lys 


Met 


Leu 


Met 


Asp 


Ser 


Arg 


Gly 










285 








Ser 


Leu 


Leu 


He 


Asp 


Arg 


Thr 


Pro 








300 










Ala 


Pro 


Gin 


Thr 


Ser 


Gly 


Ala 


Val 






315 










320 


His 


Ala 


Pro 


Leu 


Leu 


Arg 


Asp 


Gly 




330 










335 




Ala 


Gly 


Arg 


His 


Thr 


Gly 


Asn 


Gly 


345 










350 






Ala 


Ala 


His 


Ser 


Gly 


Thr 


Arg 


Gly 



365 



<210> 77 
<211> 1056 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 77 

atgccaaccc 

gaccgggagg 

gtccgcctca 

gttcccccgg 

gtcgatgttc 

tatgtgatta 

accctgctct 

acggggggcg 

actccgctgg 

tacacgatgt 

ccgtggctct 

tgcatcgccc 

tccgggtccc 

ttcatcgccg 

cagatgcggg 

ttcgagctca 

acggggatcg 

aaatcggcaa 



ccagcgatca 
ccactgtgga 
tcgtctttcc 
gaagggaaag 
ccggcgcggc 
tgggcgtcac 
acttcagccc 
aacggctggt 
gaaagatcgg 
acgcctgggg 
ccaccctgcg 
tgcgccaggg 
gggagtggat 
ggccggtgcg 
gccccaagtg 
tcgtccaccg 
cctcaagttt 
gaagcaaacg 



tttcaaaatc 
aaaggcctgc 
ggaatcgttc 
aatcctgaac 
gaccgaacaa 
cgaaagagac 
cgaaggcatc 
ctgggcctac 
cgggctgatc 
cacccagatt 
gcatatcgca 
ggatatcccg 
caatgagggc 
gacgaaggag 
gatgctcgat 
gaatccccac 
gattcgcccc 
caagcccaaa 



gccgctgttc 
cggttgatcg 
atcccgacct 
cagctgtatt 
cttgcccagg 
acctcggcca 
ctaatgggca 
ggagacggca 
tgctgggaga 
tacatcgccg 
aaggaaggaa 
gaccggttcg 
gacagcgcca 
gagatcctgt 
gtggccggtc 
ccgatgatca 
cgccctaacc 
aaatga 



aggcctcgcc 
ccgaagccgc 
acccggactg 
ctgaattcct 
ctgcacgaat 
gcggggccag 
aacaccggaa 
gcacgctgga 
actacatgcc 
ccacctggga 
gggtctacgt 
agtacaaggg 
tcgtgaaccc 
atgccgagat 
attacgcccg 
aaatcgccga 
ttcccccatc 



cgtgtttctg 
aaagcagggc 
ggtatgggcc 
ggccaatgcc 
ggccggcgcc 
cctctacaac 
gctggttccc 
ggtctacgac 
cctggcccgg 
ccgcggggaa 
catcgggtgc 
aaaattttat 
ggacggggaa 
agacccccgg 
gccggatatc 
agacaggggc 
aagggggagg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1056 



<210> 78 
<211> 351 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 78 
Met Pro Thr Pro 
1 

Pro Val Phe Leu 
20 

He Ala Glu Ala 
35 

Ser Phe He Pro 
50 



Ser Asp His Phe 
5 

Asp Arg Glu Ala 

Ala Lys Gin Gly 
40 

Thr Tyr Pro Asp 
55 



Lys He Ala Ala 
10 

Thr Val Glu Lys 
25 

Val Arg Leu He 

Trp Val Trp Ala 
60 



Val Gin Ala Ser 
15 

Ala Cys Arg Leu 
30 

Val Phe Pro Glu 
45 

Val Pro Pro Gly 
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Arg 


Glu 


Arg 


He 


Leu 


Asn 


Gin 


Leu 


Tyr 


Ser 


Glu 


Phe 


Leu 


Ala 


Asn 


Ala 


65 










70 










75 










80 


Val 


Asp 


Val 


Pro 


Gly 


Ala 


Ala 


Thr 


Glu 


Gin 


Leu 


Ala 


Gin 


Ala 


Ala 


Arg 








85 










90 










95 




Met 


Ala 


Gly 


Ala 


Tyr 


Val 


He 


Met 


Gly 


Val 


Thr 


Glu 


Arg 


Asp 


Thr 


Ser 






100 










105 










110 






Ala 


Ser 


Gly 


Ala 


Ser 


Leu 


Tyr 


Asn 


Thr 


Leu 


Leu 


Tyr 


Phe 


Ser 


Pro 


Glu 






115 










120 










125 








Gly 


He 


Leu 


Met 


Gly 


Lys 


His 


Arg 


Lys 


Leu 


Val 


Pro 


Thr 


Gly 


Gly 


Glu 


130 










135 










140 










Arg 


Leu 


Val 


Trp 


Ala 


Tyr 


Gly 


Asp 


Gly 


Ser 


Thr 


Leu 


Glu 


Val 


Tyr 


Asp 


145 








150 










155 










160 


Thr 


Pro 


Leu 


Gly 


Lys 


He 


Gly 


Gly 


Leu 


He 


Cys 


Trp 


Glu 


Asn 


Tyr 


Met 










165 










170 










175 




Pro 


Leu 


Ala 


Arg 


Tyr 


Thr 


Met 


Tyr 


Ala 


Trp 


Gly 


Thr 


Gin 


He 


Tyr 


He 








180 










185 










190 






Ala 


Ala 


Thr 


Trp 


Asp 


Arg 


Gly 


Glu 


Pro 


Trp 


Leu 


Ser 


Thr 


Leu 


Arg 


His 






195 










200 










205 








He 


Ala 


Lys 


Glu 


Gly 


Arg 


Val 


Tyr 


Val 


He 


Gly 


Cys 


Cys 


He 


Ala 


Leu 




210 








215 










220 










Arg 


Gin 


Gly Asp 


He 


Pro 


Asp 


Arg 


Phe 


Glu 


T.yr 


Lys 


Gly 


Lys 


Phe 


Tyr 


225 










230 










235 










240 


Ser 


Gly 


Ser 


Arg 


Glu 


Trp 


He 


Asn 


Glu 


Gly 


Asp 


Ser 


Ala 


He 


Val 


Asn 








245 










250 










255 




Pro 


Asp 


Gly 


Glu 


Phe 


He 


Ala 


Gly 


Pro 


Val 


Arg 


Thr 


Lys 


Glu 


Glu 


He 




260 










265 










270 






Leu 


Tyr 


Ala 


Glu 


He 


Asp 


Pro 


Arg 


Gin 


Met 


Arg 


Gly 


Pro 


Lys 


Trp 


Met 




275 










280 










285 








Leu 


Asp 


Val 


Ala 


Gly 


His 


Tyr 


Ala 


Arg 


Pro 


Asp 


He 


Phe 


Glu 


Leu 


He 




290 










295 










300 










Val 


His 


Arg 


Asn 


Pro 


His 


Pro 


Met 


He 


Lys 


He 


Ala 


Glu 


Asp 


Arg 


Gly 


305 








310 










315 










320 


Thr 


Gly 


He 


Ala 


Ser 


Ser 


Leu 


He 


Arg 


Pro 


Arg 


Pro 


Asn 


Leu 


Pro 


Pro 








325 










330 










335 




Ser 


Arg 


Gly 


Arg 


Lys 


Ser 


Ala 


Arg 


Ser 


Lys 


Arg 


Lys 


Pro 


Lys 


Lys 






340 










345 










350 







<210> 79 
<211> 990 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 79 

atgacgaaga 

tacctggatc 

aacgacgccg 

tggctcgttc 

aacgcgatca 

ggcgtcgccg 

ttcaacacgc 

gtaccgaccg 

tactcgctgc 

gcccggtatg 

ggcgagccgt 

ggctgctgtc 

tacctcgcag 

ggcaaggtcg 

accgagcagc 



aaagcggccg 
gggaacggag 
acctcgtcgt 
cgccggggcg 
gcattccgga 
tcgcgatcgg 
tgctctatat 
gcggcgaacg 
cgttcggtcg 
cgctcgccgc 
ggctgtccac 
aagccgtgcg 
acgtggacgg 
tcgcgggacc 
tcgtcggacc 



cgattcgttt 
catcgagaaa 
gttccccgaa 
caccgcagac 
cgactccacc 
cgtcaacgaa 
cggagcggac 
gctcgtctgg 
cgtaagcgga 
gtggggcgaa 
gctaaggcac 
caaggacgac 
ctggatcaac 
ggcgatggaa 
gcgctggcag 



cgggtcgctg 
gcgtgccggc 
gccttcgtgc 
ttgcgctccg 
gatcggctgt 
cgcaacaccg 
ggccggattc 
gcatctggcg 
ctgatctgct 
caggtgcacg 
atcgcgaagg 
atccctgaca 
ccaggtggca 
accgaaactg 
ctcgacgtcg 



cggtccaggc 
tgatcgacga 
ccggataccc 
cttatgcgac 
gcgccgccgc 
aagcgagcgg 
tcggaaaaca 
acggcagcga 
gggagcacta 
tcgctccaac 
aaggccgcgt 
cgctcgcgtt 
gcgtcatcat 
tactgtacgc 
gcggacatta 



ctcgtccgtc 
cgcgggacga 
actgtgggtg 
gctccacgcc 
aaaagacgcc 
catgagcctg 
ccggaagctg 
cctcgaggtc 
catgccgctc 
ctgggatcgt 
tctcgtcgtc 
caagtccaaa 
caatcctgac 
ggaccttcgc 
cgctcgtccg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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gacgtcttcg agctcgtcgt ccatcggcat ccgaagccgt tgattcggac agcgaccggt 960 
gtcaggcgcc gcaagcgtgc acgtcgctaa 990 

<210> 80 
<211> 329 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 80 



Met 


Thr 


Lys 


Lys 


Ser 


Glv 


Arg 


Asp 


Ser 


Phe 


Arg 


Val 


Ala 


Ala 


Val 


Gin 


1 








5 










10 










15 




Ala 


Ser 


Ser 


Val 


Tvr 


Leu 


Asp 


Ara 


Glu 


Arg 


Ser 


He 


Glu 


Lvs 


Ala 


Cys 








20 










25 










30 






Arg 


Leu 


He 


Asp 


Asp 


Ala 


Glv 


Arg 


Asn 


Asp 


Ala 


Asp 


Leu 


Val 


Val 


Phe 






35 










40 










45 








Pro 


Glu 


Ala 


Phe 


Val 


Pro 


Glv 

J: 


Tvr 


Pro 


Leu 


Trp 


Val 


Trp 


Leu 


Val 


Pro 




50 










55 










60 










Pro 


Gly Arg 


Thr 


Ala 


Asp 


Leu 


Arg 


Ser 


Ala 


Tvr 

J: 


Ala 


Thr 


Leu 


His 


Ala 


65 










70 










75 










80 


Asn 


Ala 


He 


Ser 


He 


Pro 


Asp 


Asp 


Ser 


Thr 


Asp 


Arg 


Leu 


Cvs 


Ala 


Ala 










85 










90 










95 




Ala 


Lys 


Asp 


Ala 


Gly 


Val 


Ala 


Val 


Ala 


He 


Glv 


Val 


Asn 


Glu 


Arg 


Asn 








100 










105 










110 






Thr 


Glu 


Ala 


Ser 


Glv 


Met 


Ser 


Leu 


Phe 


Asn 


Thr 


Leu 


Leu 


Tvr 


He 


Gly 

JT 






115 










120 










125 








Ala 


Asp 


Gly Arg 


He 


Leu 


Glv 


Lvs 


His 


Arg 


Lys 


Leu 


Val 


Pro 


Thr 


Gly 




130 










135 










140 










Gly 


Glu 


Arg 


Leu 


Val 


Tro 


Ala 


Ser 


Glv 


Asp 


Glv 


Ser 


Asp 


Leu 


Glu 


Val 


145 










150 










155 










160 


Tyr 


Ser 


Leu 


Pro 


Phe 


Gly 


Arg 


Val 


Ser 


Gly 


Leu 


He 


Cys 


Trp 


Glu 


His 










165 










170 










175 




Tyr 


Met 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ala 


Leu 


Ala 


Ala 


Trp 


Gly 


Glu 


Gin 


Val 








180 










185 










190 






His 


Val 


Ala 


Pro 


Thr 


Trp 


Asp 


Arg 


Gly 


Glu 


Pro 


Trp 


Leu 


Ser 


Thr 


Leu 






195 










200 










205 








Arg 


His 


He 


Ala 


Lys 


Glu 


Gly 


Arg 


Val 


Leu 


Val 


Val 


Gly 


Cys 


Cys 


Gin 




210 










215 










220 










Ala 


Val 


Arg 


Lys 


Asp 


Asp 


He 


Pro 


Asp 


Thr 


Leu 


Ala 


Phe 


Lys 


Ser 


Lys 


225 










230 










235 










240 


Tyr 


Leu 


Ala 


Asp 


Val 


Asp 


Gly 


Trp 


He 


Asn 


Pro 


Gly 


Gly 


Ser 


Val 


He 










245 










250 










255 




He 


Asn 


Pro 


Asp 


Gly 


Lys 


Val 


Val 


Ala 


Gly 


Pro 


Ala 


Met 


Glu 


Thr 


Glu 








260 










265 










270 






Thr 


Val 


Leu 


Tyr 


Ala 


Asp 


Leu 


Arg 


Thr 


Glu 


Gin 


Leu 


Val 


Gly 


Pro 


Arg 






275 










280 










285 








Trp 


Gin 


Leu Asp 


Val 


Gly 


Gly 


His 


Tyr 


Ala 


Arg 


Pro 


Asp 


Val 


Phe 


Glu 




290 










295 










300 










Leu 


Val 


Val 


His 


Arg 


His 


Pro 


Lys 


Pro 


Leu 


He 


Arg 


Thr 


Ala 


Thr 


Gly 


305 










310 










315 










320 


Val 


Arg 


Arg 


Arg 


Lys 


Arg 


Ala 


Arg 


Arg 

















325 



<210> 81 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 
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<223> Obtained from an environmental sample 



<400> 81 

atgaaagtcg 

accgtcgcca 

acctttcctg 

gagctcttgt 

gctgcgaccg 

gtcaatgagc 

acgctgatcc 

cagggagatg 

gcatgcttcg 

atccattcag 

atcaacatcc 

cttgacgccg 

tcgggcggct 

cgctcgggcg 

atgctgatgg 

cgtacgccgg 

agttccggcg 



tcaaagccgc 

aggtcgtccg 

aaacggtcgt 

cgggcaccga 

atgcaatcgg 

gtgacggcgg 

agcgccgccg 

gctcgggctt 

agcacaacaa 

cgatgtaccc 

gccagcatgc 

accagcaggc 

gcttcaccac 

aaggcgaagt 

actcggccgg 

ctgcgcatgt 

atctgcgcac 



cgctgtccag 

gaaaatccac 

gccttattac 

acatctgcgc 

cgaagccgcg 

cacgctttac 

caagatcacg 

gcgtgcagtc 

cccgcttgca 

tggctcggcc 

gctcgagtcc 

gcaaatcatg 

gatcgtttct 

gattgtcgat 

ccactacaac 

tcacgaacgc 

ccaggttgca 



ttcagcccgg 

gagctcggtc 

ccttacttcg 

ctgctcgaac 

cgacaggccg 

aacacgcaac 

ccgacccatt 

gacagcgcag 

cgttacgcaa 

tttggcgagg 

gccgctttcg 

aaggacaccg 

cctgacggta 

ctcgacttca 

cgccctgaac 

gcttcgcgcc 

tga 



tgctctatag 

agaaaggcgt 

cggccgtcca 

aggccgtgac 

gcatggtcgt 

tgctcttcga 

tcgaacgcat 

tcggccgcat 

tgatcgccga 

gcttcgccca 

tcgtcaacgc 

gttgtggaat 

tgctgatggc 

cgcagatcga 

tgctgagtct 

cgatgaccgt 



ccgcgaagcg 

gcagttcgcc 

gacgggcatc 

tgtgccctcc 

gtccatcggc 

tgccgacggt 

gatctggggg 

cggccagctc 

cggcgagcag 

gcgtatggag 

aacggcgtgg 

cggtccgatc 

cgatccgctt 

ccgccgcaag 

gatgatcgac 

cgacgaccag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
993 



<210> 82 
<211> 330 
<212> PRT ■ 
<213> Unknown 



<220> 

<223> Obtained from 



an environmental sample 



<400> 82 

Met Lys Val Val Lys 

1 5 
Ser Arg Glu Ala Thr 

20 

Gly Gin Lys Gly Val 
35 

Tyr Tyr Pro Tyr Phe 
50 

Gly Thr Glu His Leu 
65 

Ala Ala Thr Asp Ala 
85 

Val Ser He Gly Val 
100 

Gin Leu Leu Phe Asp 
115 

He Thr Pro Thr His 
130 

Ser Gly Leu Arg Ala 
145 

Ala Cys Phe Glu His 
165 

Asp Gly Glu Gin He 
180 

Glu Gly Phe Ala Gin 
195 

Glu Ser Ala Ala Phe 
210 

Gin Gin Ala Gin He 

225 

Ser Gly Gly Cys Phe 



Ala Ala Ala Val 
Val Ala 
Gin Phe 



Ala Ala 

55 
Arg Leu 
70 

He Gly 

Asn Glu 

Ala Asp 

Phe Glu 
135 
Val Asp 
150 

Asn Asn 

His Ser 

Arg Met 

Val Val 
215 
Met Lys 
230 

Thr Thr 



Lys Val 

25 
Ala Thr 
40 

Val Gin 



Leu Glu 

Glu Ala 

Arg Asp 
105 
Gly Thr 
120 

Arg Met 

Ser Ala 

Pro Leu 

Ala Met 
185 
Glu He 
200 

Asn Ala 
Asp Thr 
He Val 



Gin Phe 
10 

Val Arg 

Phe Pro 

Thr Gly 

Gin Ala 

75 
Ala Arg 
90 

Gly Gly 

Leu He 

He Trp 

Val Gly 
155 
Ala Arg 
170 

Tyr Pro 

Asn He 

Thr Ala 

Gly Cys 
235 
Ser Pro 



Ser Pro 

Lys He 

Glu Thr 

45 
He Glu 
60 

Val Thr 

Gin Ala 

Thr Leu 

Gin Arg 
125 
Gly Gin 
140 

Arg He 
Tyr Ala 
Gly Ser 



Arg Gin 
205 
Trp Leu 
220 

Gly He 
Asp Gly 



Val Leu 

15 
His Glu 
30 

Val Val 

Leu Leu 

Val Pro 

Gly Met 
95 

Tyr Asn 
110 

Arg Arg 

Gly Asp 

Gly Gin 

Met He 
175 
Ala Phe 
190 

His Ala 



Asp Ala 
Gly Pro 
Met Leu 



Tyr 

Leu 

Pro 

Ser 

Ser 

80 

Val 

Thr 

Lys 

Gly 

Leu 
160 
Ala 

Gly 

Leu 

Asp 



He 
240 
Met 
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245 

Ala Asp Pro Leu Arg Ser Gly Glu 
260 

Phe Thr Gin lie Asp Arg -Arg Lys 
275 280 
Tyr Asn Arg Pro Glu Leu Leu Ser 

290 295 
Ala His Val His Glu Arg Ala Ser 
305 310 
Ser Ser Gly Asp Leu Arg Thr Gin 
325 



250 255 
Gly Glu Val lie Val Asp Leu Asp 
265 270 
Met Leu Met Asp Ser Ala Gly His 
285 

Leu Met lie Asp Arg Thr Pro Ala 
300 

Arg Pro Met Thr Val Asp Asp Gin 
315 320 

Val Ala 
330 



<210> 83 
<211> 1071 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 83 

atgatgagtt 

ggggagtcca 

gctgcgcgcg 

tatttctcgt 

cagtctgtcg 

agcatggtgg 

ttaattttcg 

cacgagcgca 

atcggccgtg 

ctgatggccc 

attttcgccg 

gtcgttaatg 

gcgatgcaaa 

catctgtgcg 

ctcattacca 

ctgtcgctgt 

gccgccgccg 

cgcgagtcag 



cagcccgtgt 
caaatagccg 
ccggcgcgca 
tcattacgcc 
tggtgccgag 
tcgtgctcgg 
atgcgagcgg 
tggtgtgggg 
tcggtgcgct 
agcacgaaga 
agcaaatgga 
caacggcgtg 
aagcactgcg 
agccgctacg 
aacgcaaacg 
taatcgacca 
taaaaaatac 
ttaataacgg 



aataaaactc 
cgacggcacc 
gctgatcgtg 
ggcggtgacg 
cgccgccact 
tattaacgaa 
cgaattatta 
tcagggcgac 
cgcctgctgg 
aattcattgc 
agtgacaatg 
gttatcggaa 
cggcggttgc 
cgaaggtgaa 
catgatggat 
tcgcgccacc 
tcggagttcc 
aactccaatc 



gccgcagcac 
attgccaaag 
tttcccgaaa 
atgggggcgg 
gatactgttg 
cgcgatcacg 
ttaaaacgcc 
ggcagcggtt 
gaacattaca 
agtcaatttc 
cgccaccacg 
gcgcaaattc 
tacaccgcaa 
ggtttgattt 
tcggttggtc 
acaccattgc 
gctcatgaat 
gcacggcttg 



agctttcacc 
tcgtcgcggc 
cggtggtgcc 
agcatttgcg 
ccgccgctgc 
gcacgctcta 
gaaaaattac 
tgaaaaccgt 
acccattggc 
cggggtcatt 
cgctcgaatc 
aatcgatcag 
ttatttcgcc 
ttgccgaagc 
attacgcgcg 
atagcgtcac 
cagccgatag 
cgcctagttg 



tgtgctgccg 
gattgcggag 
gtattacccg 
cttgtacgat 
aaaaaaacac 
caacgcgcaa 
cccgacctat 
cgacaccgcg 
gcgttacagc 
ggtcgggcca 
cggttgcttc 
cagcgatccc 
cgaaggcaaa 
cgatatggcg 
acccgaattg 
cgcgagtgat 
tgaaaccatc 



60 
120 
180 
24a 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1071 



<210> 84 
<211> 356 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 84 



Met 


Met 


Ser 


Ser 


Ala 


Arg 


Val 


He 


Lys 


Leu 


Ala 


Ala 


Ala 


Gin 


Leu 


Ser 


1 








5 








10 










15 




Pro 


Val 


Leu 


Pro 
20 


Gly 


Glu 


Ser 


Thr 


Asn 
25 


Ser 


Arg 


Asp 


Gly 


Thr 
30 


He 


Ala 


Lys 


Val 


Val 


Ala 


Ala 


He 


Ala 


Glu 


Ala 


Ala 


Arg 


Ala 


Gly 


Ala 


Gin 


Leu 




35 










40 










45 • 








lie 


Val 
50 


Phe 


Pro 


Glu 


Thr 


Val 
55 


Val 


Pro 


Tyr 


Tyr 


Pro 
60 


Tyr 


Phe 


Ser 


Phe 


He 


Thr 


Pro 


Ala 


Val 


Thr 


Met 


Gly 


Ala 


Glu 


His 


Leu 


Arg 


Leu 


Tyr 


Asp 


65 










70 










75 










80 


Gin 


Ser 


Val 


Val 


Val 
85 


Pro 


Ser 


Ala 


Ala 


Thr 
90 


Asp 


Thr 


Val 


Ala 


Ala 
95 


Ala 
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Ala 


Lys 


Lys 


His 








100 


His 


Gly 


Thr 


Leu 






115 




Leu 


Leu 


Leu 


Lys 




130 






Val 


Trp 


Gly 


Gin 


145 








He 


Gly 


Arg 


Val 


Ala 


Arg 


Tvr 


Ser 








180 


Phe 


Pro 


Gly 


Ser 






195 




Thr 


Met 


Arg 


His 




210 






Thr 


Ala 


Trp 


Leu 


225 








Ala 


Met 


Gin 


Lys 


Pro 


Glu 


Glv 


Lys 








260 


He 


Phe 


Ala 


Glu 






275 




Met 


Asp 


Ser 


Val 




290 






He 


Asp 


His 


Arg 


305 








Ala 


Ala 


Ala 


Val 


Ser 


Glu 


Thr 


He 








340 


Leu 


Ala 


Pro 


Ser 






355 





Ser 


Met 


Val 


Val 


Tvr 


Asn 


Ala 


Gin 








120 


Arg 


Arg 


Lys 


He 






135 




Gly 


Asp 


Gly 


Ser 




150 






Gly 


Ala 


Leu 


Ala 


165 








Leu 


Met 


Ala 


Gin 


Leu 


Val 


Gly 


Pro 








200 


His 


Ala 


Leu 


Glu 






215 




Ser 


Glu 


Ala 


Gin 




230 






Ala 


Leu 


Arg 


Gly 


245 








His 


Leu 


Cys 


Glu 


Ala 


Asp 


Met 


Ala 








280 


Gly 


His 


Tyr 


Ala 






2 95 




Ala 


Thr 


Thr 


Pro 




310 






Lys 


Asn 


Thr 


Arg 


325 








Arg 


Glu 


Ser 


Val 



Val 


Leu 


Gly 


He 


105 








Leu 


He 


Phe 


Asp 


Thr 


Pro 


Thr 


Tvr 








140 


Gly 


Leu 


Lys 


Thr 






155 




Cys 


Trp 


Glu 


His 




170 






His 


Glu 


Glu 


He 


185 








He 


Phe 


Ala 


Glu 


Ser 


Glv 
j 


Cys 


Phe 








220 


He 


Gin 


Ser 


He 






235 




Gly 


Cys 


Tyr 


Thr 




250 






Pro 


Leu 


Arg 


Glu 


265 








Leu 


He 


Thr 


Lys 


Arg 


Pro 


Glu 


Leu 








300 


Leu 


His 


Ser 


Val 






315 




Ser 


Ser 


Ala 


His 




330 






Asn 


Asn 


Gly 


Thr 



345 



Asn 


Glu 


Arg 


Asp 




110 






Ala 


Ser 


Gly 


Glu 


125 








His 


Glu 


Arg 


Met 


Val 


Asp 


Thr 


Ala 








160 


Tyr 


Asn 


Pro 


Leu 






175 




His 


Cys 


Ser 


Gin 




190 






Gin 


Met 


Glu 


Val 


205 








Val 


Val 


Asn 


Ala 


Ser 


Ser 


Asp 


Pro 








240 


Ala 


He 


He 


Ser 






255 




Gly 


Glu 


Gly 


Leu 




270 






Arg 


Lys 


Arg 


Met 


285 








Leu 


Ser 


Leu 


Leu 


Thr 


Ala 


Ser 


Asp 








320 


Glu 


Ser 


Ala 


Asp 






335 




Pro 


He 


Ala 


Arg 




350 







<210> 85 
<211> 1014 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 85 

atgggtctgg 

gacctcgatg 

gcgaagctga 

ggggcgcccg 

tcatttgaca 

gcggtgatcg 

ggtcccgacg 

accgtgttcg 

cggctcggtg 

gcccagaacg 

gcccatgcgc 

tcatgtttct 

gattcccccg 

ccggacggga 

gata-tcgatc 

tccaggccgg 

ttttcattgc 



ttcatcagaa 
cgacggtgga 
tcgcgtttcc 
cctgggcgat 
gcccgcaggc 
gcgtctccga 
gcgagaccat 
gcgagggcga 
caatgtgctg 
agcaggttca 
ttggctggga 
tcctcggccc 
aaaagcacgc 
gttcgcttgc 
tcggcatgat 
acgtcacgcg 
cgatcgatgc 



atacaaggtt 
caagacgatc 
cgagaccttc 
cggccgtggc 
cgaaaaaatt 
acgcgacggc 
tgcgaagcgc 
cggcagcgac 
ctgggagcat 
cgtcggcgcc 
agtaaacaac 
gtgcgcggtg 
cttcctgcac 
cgacaaactt 
cggcgtggca 
gctgctgctc 
cgaggtcatg 



gcggtggttc 
gccctgatcg 
attcccggat 
ttcgtgcagc 
cgcaaggccg 
ggcagcctct 
cgcaagctgc 
ctcgccgtcc 
ctgcagccgc 
tggccgagct 
gcggcgagca 
gtctcgcagg 
gctggcggcg 
ccacccgatc 
aagaacgccg 
aacacttccc 
agcgaaatca 



aggcggcgcc 
agcaggccgc 
atccgtggca 
gctatttcga 
tcaagcgcgc 
atatcggcca 
ggccgaccca 
atgatcgcgc 
tgtcgaaata 
tctcattgta 
aggtttatgc 
ccatgatcga 
gccacgcggt 
aggagggcat 
ccgaccccgc 
gcgccaatcg 
gacttcaggc 



ggtctttctc 
agcacagggc 
gatctggctt 
taactcgttg 
caagctgacc 
atggctgatc 
tgccgaacgc 
cgacgtggga 
cgcgat'gtac 
cgacccattc 
tgtcgagggc 
cgagctctgc 
aatctatggg 
tctgtatgcc 
aggacactat 
cgtcgagcat 
ctga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1014 
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<210> 86 
<211> 337 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 86 

Met Gly Leu Val His Gin Lys Tyr Lys Val Ala Val Val Gin Ala Ala 

15 10 15 

Pro Val Phe Leu Asp Leu Asp Ala Thr Val Asp Lys Thr lie Ala Leu 

20 25 30 

He' Glu Gin Ala Ala Ala Gin Gly Ala Lys Leu He Ala Phe Pro Glu 

35 40 45 

Thr Phe He Pro Gly Tyr Pro Trp Gin He Trp Leu Gly Ala Pro Ala 

50 55 60 

Trp Ala He Gly Arg Gly Phe Val Gin Arg Tyr Phe Asp Asn Ser Leu 
65 70 75 80 

Ser Phe Asp Ser Pro Gin Ala Glu Lys He Arg Lys Ala Val Lys Arg 

85 90 95 

Ala Lys Leu Thr Ala Val He Gly Val Ser Glu Arg Asp Gly Gly Ser 

100 105 HO 

Leu Tyr He Gly Gin Trp Leu He Gly Pro Asp Gly Glu Thr He Ala 

115 120 125 

Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Phe Gly 

130 135 140 

Glu Gly Asp Gly Ser Asp Leu Ala Val His Asp Arg Ala Asp Val Gly 
145 150 155 160 

Ara Leu Gly Ala Met Cys Cys Trp Glu His Leu Gin Pro Leu Ser Lys 

165 170 175 

Tyr Ala Met Tyr Ala Gin Asn Glu Gin Val His Val Gly Ala Trp Pro 

180 185 190 

Ser Phe Ser Leu Tyr Asp Pro Phe Ala His Ala Leu Gly Trp Glu Val 

195 200 205 

Asn Asn Ala Ala Ser Lys Val Tyr Ala Val Glu Gly Ser Cys Phe Phe 

210 215 220 

Leu Gly Pro Cys Ala Val Val Ser Gin Ala Met He Asp Glu Leu Cys 
225 230 235 240 

Asp Ser Pro Glu Lys His Ala Phe Leu His Ala Gly Gly Gly His Ala 

245 250 255 

Val He Tyr Gly Pro Asp Gly Ser Ser Leu Ala Asp Lys Leu Pro Pro 

260 265 270 

Asp Gin Glu Gly He Leu Tyr Ala Asp He Asp Leu Gly Met He Gly 

275 280 285 

Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp 

290 295 300 

Val Thr Arg Leu Leu Leu Asn Thr Ser Arg Ala Asn Arg Val Glu His 
305 310 315 320 

Phe Ser Leu Pro He Asp Ala Glu Val Met Ser Glu He Arg Leu Gin 
325 330 335 

Ala 



<210> 87 
<211> 1062 
<212> DNA 
<213> Unknown 

<220> 
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<22 3> Obtained from an environmental sample 



<400> 87 

atggcggaat 

gatgccacga 

ttggtggtct 

ccaagccaac 

gtgccgtcgg 

gtgatcggcg 

ctcgtgatcg 

ccggagcgca 

gtcggcaagc 

atggcggcgt 

atctcgacca 

gcgctgcgca 

cgcgaatgga 

gggccgttgc 

ggtccgcgtt 

acggtaaacc 

aacacgagag 

gcgcgaaagg 



cgaagctgaa 
tcgcccgcgc 
ttcccgaggc 
gcaaactgct 
cctccgtgga 
tcaacgagcg 
gtccactggg 
tggtgtgggc 
tttcgacgtt 
ggggcgcgcg 
tgcgtcatgt 
aacgagacat 
tcaacgccgg 
acgagcagga 
ggatgttcga 
gctccccgcg 
atgccgtacc 
ccgcacgcac 



ggtcgccgca 
ctgcgagcgg 
attcgtgccc 
caatgatctt 
tcgtttgcgc 
caacaccgaa 
gcagctgatc 
gcagggcgac 
gatctgctgg 
catccacgtc 
ggcgacggag 
tcccgccgag 
tgattcgctg 
aggaatcctc 
cgccgccggc 
cccgatgctg 
catggacagc 
cggtcgctcc 



attcaagttg 
atcgccgaag 
gggtatcccg 
tacgcgcacc 
gacgcggctc 
gcgagcggcg 
ggccgccacc 
ggcagcacgc 
gagaactaca 
gccggcacgt 
ggccgcgtat 
ctcgagttcg 
gtcgtgaatc 
tacgccgagc 
cattacgcgc 
cgggaggcgg 
acgccctcga 
aagcggcggt 



cgcccgtgtt 
ccgcccgcgc 
actggatctg 
tcgtctcgca 
gcgacggcgg 
cgagcctcta 
gcaagcttgt 
tcgacgtcta 
tgccgctcgc 
gggaccgcgg 
tcgtgattag 
cgatgctcta 
ccgctggcca 
tcgagcgcaa 
gaccggacgt 
gggcaaagac 
gatcgcggcc 
ga 



catggatcgc 
cggcgcggag 
ggtggcgcgg 
gtcggtcgac 
ggtcacggtg 
caacaccgcg 
gccgaccggg 
cgacacaccc 
gcgctacgcc 
cgagccgtgg 
ctgttgcatg 
tcccgacggg 
gatcatcgct 
tcagatgacc 
cttccaactc 
gagtgaggca 
ccgcgcggtg 



<210> 88 
<211> 353 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1062 



<400> 88 

Met Ala Glu Ser Lys Leu Lys Val Ala Ala He 

15 10 
Phe Met Asp Arg Asp Ala Thr He Ala Arg Ala 

20 25 
Glu Ala Ala Arg Ala Gly Ala Glu Leu Val Val 

• 35 40 
Val Pro Gly Tyr Pro Asp Trp He Trp Val Ala 

50 55 
Lys Leu Leu Asn Asp Leu Tyr Ala His Leu Val 
65 70 75 

Val Pro Ser Ala Ser Val Asp Arg Leu Arg Asp 

85 90 
Gly Val Thr Val Val He Gly Val Asn Glu Arg 

100 105 
Gly Ala Ser Leu Tyr Asn Thr Ala Leu Val He 

115 120 
Leu He Gly Arg His Arg Lys Leu Val Pro Thr 

130 135 
Val Trp Ala Gin Gly Asp Gly Ser Thr Leu Asp 
145 150 155 

Val Gly Lys Leu Ser Thr Leu He Cys Trp Glu 

165 170 
Ala Arg Tyr Ala Met Ala Ala Trp Gly Ala Arg 

180 185 
Thr Trp Asp Arg Gly Glu Pro Trp He Ser Thr 

195 200 
Thr Glu Gly Arg Val Phe Val He Ser Cys Cys 

210 215 
Arg Asp He Pro Ala Glu Leu Glu Phe -Ala Met 
225 230 235 



Gin Val Ala 

Cys Glu Arg 
30 

Phe Pro Glu 
45 

Arg Pro Ser 
60 

Ser Gin Ser 

Ala Ala Arg 

Asn Thr Glu 
110 

Gly Pro Leu 

125 
Gly Pro Glu 
140 

Val Tyr Asp 

Asn Tyr Met 

He His Val 
190 

Met Arg His 

205 
Met Ala Leu 
220 

Leu Tyr Pro 



Pro Val 
15 

He Ala 

Ala Phe 

Gin Arg 

Val Asp 

80 
Asp Gly 
95 

Ala Ser 

Gly Gin 

Arg Met 

Thr Pro 
160 
Pro Leu 
175 

Ala Gly 

Val Ala 

Arg Lys 

Asp Gly 
240 
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Arg 


Glu 


Trp 


He 


Asn 


Ala 


Gly 


Asp 










245 








Gin 


He 


He 


Ala 


Gly 


Pro 


Leu 


His 








260 










Glu 


Leu 


Glu 


Arg 


Asn 


Gin 


Met 


Thr 






275 










280 


Ala 


Gly 


His 


Tyr 


Ala 


Arg 


Pro 


Asp 




290 










295 




Ser 


Pro 


Arg 


Pro 


Met 


Leu 


Arg 


Glu 


305 










310 






Asn 


Thr 


Arg 


Asp 


Ala 


Val 


Pro 


Met 










325 








Pro 


Arg 


Ala 


Val 


Ala 


Arg 


Lys 


Ala 








340 










Arg 

















Ser 


Leu 


Val 


Val 


Asn 


Pro 


Ala 


Gly 




250 










255 




Glu 


Gin 


Glu 


Gly 


He 


Leu 


Tyr 


Ala 


265 










270 






Gly 


Pro 


Arg 


Trp 


Met 


Phe 


Asp 


Ala 










285 








Val 


Phe 


Gin 


Leu 


Thr 


Val 


Asn 


Arg 








300 










Ala 


Gly 


Ala 


Lys 


Thr 


Ser 


Glu 


Ala 






315 










320 


Asp 


Ser 


Thr 


Pro 


Ser 


Arg 


Ser 


Arg 




330 










335 




Ala 


Arg 


Thr 


Gly 


Arg 


Ser 


Lys 


Arg 


345 










350 







<210> 89 
<211> 918 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 89 

atgaatacca 

agcaaaacgg 

ctcattgtat 

aatggaaatt 

atccctgatt 

gcagtcggta 

ctatacatta 

ggcgaacgcc 

tttggcaaat 

atgtacgcag 

ttgctgagta 

gctgtccgaa 

tcagaatgga 

ggaccatcct 

aaatctaaac 

gaaatcaaaa 



aagaagtaaa 
tggaaaaaac 
ttccggaagc 
caacaatgct 
caacaacaca 
tccatgaaag 
atgatcaagg 
tggtctgggg 
tgggaggatt 
ttgggactga 
tgcgccatat 
aagacgatat 
tcaatagcgg 
caaacaggca 
gcatgttcga 
aaagctag 



ggtcgcagcc 
ctgcaacttg 
cttcctctct 
ggatgattta 
gaaactctgt 
aaatgcagaa 
cagcatcatt 
gcagggtaat 
gctttgctgg 
agtttatgtt 
agccagagag 
tcctgaccgc 
caacagttgc 
agaaatactc 
tgttaccggg 



gctcaatttg 
atttccgaag 
ggttatcccg 
tatcaggaat 
caggcagcaa 
gcaagtggct 
ggaaaacacc 
ggggatacgc 
gaaaactaca 
gccccaacct 
ggcggcatgt 
tatgcattca 
atcatcaatc 
tacgcagatt 
cattatgccc 



ccccacattt 
caggcaaaaa 
attgggtctg 
tggttgagaa 
aagatgccgg 
tcacactttt 
gaaaactgat 
ttgctgcatt 
tgcccctggc 
gggactccag 
ttgtgatcaa 
agcaactcta 
cgcgcggtga 
tagatctgag 
ggccggatgt 



tctgaatttg 
tggagcaaag 
gttaattccc 
cgctgtaaca 
ggtatatgtc 
caataccctt 
cccaacaggg 
cgatacacac 
tcggcaagct 
tgagaattgg 
tgtttgccag 
ttctggtaat 
aatcattgcc 
tttgattaca 
gtttagatat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
918 



<210> 90 
<211> 305 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
<400> 90 

Met Asn Thr Lys Glu Val Lys Val Ala Ala Ala Gin Phe Ala Pro His 

15 10 15 

Phe Leu Asn Leu Ser Lys Thr Val Glu Lys Thr Cys Asn Leu He Ser 

20 25 30 

Glu Ala Gly Lys Asn Gly Ala Lys Leu He Val Phe Pro Glu Ala Phe 

35 40 45 

Leu Ser Gly Tyr Pro Asp Trp Val Trp Leu He Pro Asn Gly Asn Ser 

50 55 60 

Thr Met Leu Asp Asp Leu Tyr Gin Glu Leu Val Glu Asn Ala Val Thr 
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DO 










70 










75 










80 


JL lc 


Pro 


Asp 


Ser 


Thr 


Thr 


Gin 


Lys 


Leu 


Cvs 


Gin 


Ala 


Ala 


Lys 


Asp 


Ala 








O ~J 










90 










95 






Val 


Tyr 


Val 


Ala 


Val 


Gly 


lie 


His 


Glu 


Arg 


Asn 


Ala 


Glu 


Ala 


Ser 




i nn 

1UU 










105 










110 








Phe 


Thr 


Leu. 


ir lie 


Asn 


Thr 


Leu 


Leu 


Tvr 


He 


Asn 


Asp 


Gin 


Gly 


Ser 




lie; 

x i. o 










12 0 










125 










lie 


Pi ,1 

(jj_y 


Ly s 




"A v>rr 
r\J- y 


Ly s 


Leu 


He 


Pro 


Thr 


Gly 


Glv 


Glu 


Arg 


Leu 




1 OU 








ljj 










140 










Val 


Trp 


Gly 




CI \r 






Asp 


Thr 


Leu 


Ala 


Ala 


Phe 


Asp 


Thr 


His 


145 






1DU 










155 










160 


fne 


Gly 


Ly s 


Leu 


biy 




Leu 


Leu 


Cys 


Tm 


Glu 


Asn 


Tvr 
j. j j_ 


Met 


Pro 


Leu 






1 £ ^ 

IDj 










170 










175 




Ala 


Arg 


Gin 


/\j.a 


rde u 


Tyr 


7\ I o 


Val 


Gly 


Thr 


Glu 


Val 


Tvr 


Val 


Ala 


Pro 






loU 










IRS 

lO J 










190 






Thr 


Trp 


Asp 


Ser 


Ser 


r*l n 

blU 


Asn 




Leu 


Leu 


Ser 


Met 


Arg 


His 


He 


Ala 




i y D 










9 on 










'205 








Arg 




biy 




L v lc L. 


Phe 


Val 


lie 


Asn 


Val 


Cys 


Gin 


Ala 


Val 


Arg 


Lys 


zlU 








9 1 R 










220 










Asp 


Asp 


Tin 

lie 


Pro 


Asp 


Arg 




Ala 


Phe 


Lys 


Gin 


Leu 


Tvr 


Ser 


Gly 


Asn 


225 








230 










235 










240 


O t£ -L. 


JL LI 




He 


Asn 


Ser 


Glv 


Asn 


Ser 


Cvs 


He 


He 


Asn 


Pro 


Arg 


Gly 








245 










250 










255 




Glu 


He 


lie 


Ala 


Gly 


Pro 


Ser 


Ser 


Asn 


Arg 


Gin 


Glu 


He 


Leu 


Tyr 


Ala 








260 








265 










270 






Asp 


Leu 


Asp 


Leu 


Ser 


Leu 


He 


Thr 


l y s 


Ser 


Lys 


Arg 


Met 


Phe 


Asp 


Val 




275 










280 










285 








Thr 


Gly 


His 


Tyr 


Ala 


Arg 


Pro 


Asp 


Val 


Phe 


Arg 


Tyr 


Glu 


He 


Lys 


Lys 




290 








295 










300 










Ser 
































305 

































<210> 91 
<211> 939 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 91 

atgaccaaaa 

gaeagagegg 

cccgaagcct 

atgggaaaaa 

gaggggctta 

tatcaagagg 

gatgeggacg 

atggtttggg 

atcggcacgc 

caaaacatcg 

ctgeagcata 

gcctctgaca 

tgggtaaacc 

atgeatcagg 

cgtcggaagt 

aatcgcaccg 

<210> 92 
<211> 312 
<212> PRT 



tegctgtcat 
tcgacttgat 
ggcttgeagg 
ctgatgagct 
gaecattgea 
tggatggcgc 
ggcgactggc 
gttttggcga 
tgatttgctg 
aaatctatgt 
tcgcgcgcga 
tcccggacga 
ctggcgatgc 
aaaaggggct 
tegatgegag 
cgatgcggcc 



tcaagaacct 
tgecaatget 
ttaccccacc 
ttacgcgcgt 
ggaggecgea 
gggaagcagc 
caacaatcat 
eggttcagge 
ggaaaactac 
tgccccgact 
aggtggctgc 
cgttccccat 
ggtggtttat 
tctcatcgca 
cgggcactac 
agttgatttc 



ccggtctatc 
gcaagcaagg 
ttcgtctggc 
ttgctcgcca 
aaggagcatg 
agcacgatct 
cgcaagttga 
ctgaaegteg 
atgccgttag 
tgggacagtg 
tgggtcatcg 
egggatgage 
aagccatttg 
gagttggacg 
gctcgccccg 
acgaattag 



tgaatctgag 
ggtgtgagtt 
gtcttgcgcc 
acteggtega 
gcgttgtcat 
tcaacagctg 
tgcccaccaa 
ttgacaccgc 
cgcgctacgc 
gtgccatgtg 
gatgtgcaac 
tattcccgaa 
gcggcattgt 
tcgccgctgt 
atgtcttcaa 



taaatcgatg 
gattgtgttt 
gggcagcgga 
ccgtagcaaa 
tgtgctgggt 
tgegattatt 
teeggagagg 
ggtgggcagg 
gctgtatgtc 
gcaggcgacc 
gtcgctggaa 
caaagacgaa 
ggccggcccc 
teagtegtea 
actgcatgtg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
939 
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<213> Unknown 
<220> 

<223> Obtained from an environmental sample 

Met°Thr 2 Lys He Ala Val He Gin Glu Pro Pro Val Tyr Leu Asn Leu 

15 10 15 

Ser Lys Ser Met Asp Arg Ala Val Asp Leu He Ala Asn Ala Ala Ser 

20 25 30 

Lvs Glv Cys Glu Leu He Val Phe Pro Glu Ala Trp Leu Ala Gly Tyr 

35 40 45 

Pro Thr Phe Val Trp Arg Leu Ala Pro Gly Ser Gly Met Gly Lys Thr 

50 55 60 

Asp Glu Leu Tyr Ala Arg Leu Leu Ala Asn Ser Val Asp Arg Ser Lys 
65 "70 75 80 

Glu Glv Leu Arg Pro Leu Gin Glu Ala Ala Lys Glu His Gly Val Val 

85 90 95 

He Val Leu Gly Tyr Gin Glu Val Asp Gly Ala Gly Ser Ser Ser Thr 

100 105 HO 

He Phe Asn Ser Cys Ala He He Asp Ala Asp Gly Arg Leu Ala Asn 

115 120 125 

Asn His Arg Lys Leu Met Pro Thr Asn Pro Glu Arg Met Val Trp Gly 

130 135 140 

Phe Gly Asp Gly Ser Gly Leu Asn Val Val Asp Thr Ala Val Gly Arg 
145 150 155 1^0 

He Gly Thr Leu He Cys Trp Glu Asn Tyr Met Pro Leu Ala Arg Tyr 

165 170 175 

Ala Leu Tyr Val Gin Asn He Glu He Tyr Val Ala Pro Thr Trp Asp 

180 185 190 

Ser Gly Ala Met Trp Gin Ala Thr Leu Gin His He Ala Arg Glu Gly 

195 200 205 

Gly Cys Trp Val He Gly Cys Ala Thr Ser Leu Glu Ala Ser Asp He 

210 215 220 

Pro Asp Asp Val Pro His Arg Asp Glu Leu Phe Pro Asn Lys Asp Glu 
225 230 235 240 

Trp Val Asn Pro Gly Asp Ala Val Val Tyr Lys Pro Phe Gly Gly lie 
^ * * r- o c; n 255 



245 — ~ 

Val Ala Gly Pro Met His Gin Glu Lys Gly Leu Leu He Ala Glu Leu 

260 265 270 

Asp Val Ala Ala Val Gin Ser Ser Arg Arg Lys Phe Asp Ala Ser Gly 

275 280 285 

His Tyr Ala Arg Pro Asp Val Phe Lys Leu His Val Asn Arg Thr Ala 

290 295 300 

Met Arg Pro Val Asp Phe Thr Asn 
305 310 

<210> 93 
<211> 978 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

^°°Jca?ca tcaaagccgc tgccgtgcaa atcagcccgg tgctttacag tcgcgaaggc 60 

accgtggaca aggtc?gtca acagatcatc gacctcggtc ggcaaggcgt gcagttcgcc 120 

Statttccgg aaacggtggt gccttactac ccgtactttt cgtttgtgca gccggccttt 180 

gcca'tgggS acagcacct caagttgctg gatcaatcgg tgacagtgcc gtcggccgcc 240 

fccttggcca tcggtgaagc ttgcaagcaa gcagggatag tggtgtccat cggcgtcaac 300 
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gaacgcgatg gcggtacgat ctacaacgcg caattactct tcgatgccga cggcagcctg 360 

attcagcatc gccgcaaaat caccccgacc tatcacgaac gcatggtctg ggggcaaggc 420 

gatggttccg gcctgcgcgc catcgacagt gcagtggggc gcattggctc cctggcctgt 4 80 

tgggagcatt acaacccgct ggctcgttat gccttgatgg ccgatggcga gcagatccac 540 

gccgcgatgt ttcccggctc gctggtgggc gacatttttg ccgagcagat cgaagtcacc 600 

atccgccatc acgccttgga gtccggctgt ttcgtggtca acgccaccgc ctggctggac 660 

gccgatcagc agggccaaat catgcaagac accggttgca gcctcggccc gatctcgggt 720 

ggctgcttca ccgccatcgt ttcccctgaa ggcaagttgc tcggtgagcc gctgcgttcc 780 

ggcgaagggg tggtgatcgc cgatctcgat ctggcactga tcgataagcg taaacggatg 8 40 

atggattcgg tcgggcatta cagtcgcccg gaactgctca gcctgttgat cgaccgcacg 900 

cccacagcgc atgtgcatga acgcagcgcg cacctggtgg ctgtcgctac cgaggagttc 960 

gatcatgcaa accaatga 978 - 



<210> 94 
<211> 325 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 94 



Met 


Pro 


He 


l j.e 


Lys 


Hid 


A-La 


Ala 


v dl 


uJLU 


Tip 

-L J. t; 


Ser 


Pro 


Val 


Leu 




1 








c 
O 










1 u 














Ser 


Arg 


Glu 


oj.y 


i nr 


V a. J. 


Asp 


Lys 


Val 


Cys 


Gin 


Gin 


He 


He 


Asp 


Leu 






z U 










^ z> 










30 






Gly Arg 


Gin 




V al 


ni l n 

Ui.ll 


Phe 


Ala 


Val 


Phe 


Pro 


Glu 


Thr 


Val 


Val 


Pro 






35 










40 










45 








Tyr 


Tyr 


Pro 


Tyr 


irne 


Ser 


Phe 


Val 


uXli 


Pro 


Ala 


Phe 


Ala 


Met 


Gly 


Ala 




50 










55 










60 










Gin 


His 


Leu 


Lys 


Leu 


Leu 


Asp 


Gin 


Ser 


Val 


Thr 


Val 


Pro 


Ser 


Ala 


Ala 


65 










70 










75 










80 


Thr 


Leu 


Ala 


He 


Gly 


Glu 


Ala 


Cys 


Lys 


Gin 


Ala 


Gly 


He 


Val 


Val 


Ser 










85 










90 










95 




lie 


Gly Val 


Asn 


Glu 


Arg 


Asp 


Gly 


Gly 


Thr 


He 


Tyr 


Asn 


Ala 


Gin 


Leu 








100 










105 










110 






Leu 


Phe 


Asp 


Ala 


Asp 


Gly 


Ser 


Leu 


He 


Gin 


His 


Arg 


Arg 


Lys 


He 


Thr 






115 










120 










125 








Pro 


Thr 


Tyr 


His 


Glu 


Arg 


Met 


Val 


Trp 


Gly 


Gin 


Gly Asp 


Gly 


Ser 


Gly 


130 










135 










140 










Leu 


Arg 


Ala 


He 


Asp 


Ser 


Ala 


Val 


Gly 


Arg 


He 


Gly 


Ser 


Leu 


Ala 


Cys 


145 








150 










155 










160 


Trp 


Glu 


His 


Tyr 


Asn 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ala 


Leu 


Met 


Ala 


Asp 


Gly 






165 










170 










175 




Glu 


Gin 


He 


His 


Ala 


Ala 


Met 


Phe 


Pro 


Gly 


Ser 


Leu 


Val 


Gly 


Asp 


He 








180 










185 










190 






Phe 


Ala 


Glu 


Gin 


He 


Glu 


Val 


Thr 


He 


Arg 


His 


His 


Ala 


Leu 


Glu 


Ser 






195 










200 










205 








Gly 


Cys 


Phe 


Val 


Val 


Asn 


Ala 


Thr 


Ala 


Trp 


Leu 


Asp 


Ala 


Asp 


Gin 


Gin 


210 










215 










220 










Gly 


Gin 


He 


Met 


Gin 


Asp 


Thr 


Gly 


Cys 


Ser 


Leu 


Gly 


Pro 


He 


Ser 


Gly 


225 










230 










235 










240 


Gly 


Cys 


Phe 


Thr 


Ala 


He 


Val 


Ser 


Pro 


Glu 


Gly 


Lys 


Leu 


Leu 


Gly 


Glu 






245 










250 










255 




Pro 


Leu 


Arg 


Ser 


Gly 


Glu 


Gly Val 


Val 


He 


Ala 


Asp 


Leu 


Asp 


Leu 


Ala 






260 










265 










270 






Leu 


lie 


Asp 


Lys 


Arg 


Lys 


Arg 


Met 


Met 


Asp 


Ser 


Val 


Gly 


His 


Tyr 


Ser 






275 










280 










285 








Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


Leu 


Leu 


He 


Asp 


Arg 


Thr 


Pro 


Thr 


Ala 


His 


290 










295 










300 










Val 


His 


Glu 


Arg 


Ser 


Ala 


His 


Leu 


Val 


Ala 


Val 


Ala 


Thr 


Glu 


Glu 


Phe 
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305 310 315 320 

Asp His Ala Asn Gin 
325 

<210> 95 
<211> 966 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 95 

atgtccaacg agaataacat 
tttcttgatc gtgaagcaac 
gaaggagcgc gcctgattgt 
tggggtattc cctccggtga 
aatgcggtca ccattcccag 
aatgcctacg tagtgatggg 
tataatacgc tgttgtacat 
gtgccaacgg gcggcgaacg 
tacgatactc cattgggaaa 
gcacggtatg ctatgtatgc 
ggtcaaccct ggctttctac 
ggttgctgta tcgcgatgcg 
tattatgctg aaatggatga 
gggcacttta ttgccgggcc 
cctcgcatgg tgcaaggccc 
gatgtgttcc agttgacggt 
tcataa 

<210> 96 
<211> 321 
<212> PRT 
<213> Unknown 

<220> • 

<223> Obtained from an environmental sample 



<400> 96 



Met 


Ser 


Asn 


Glu 


Asn 


Asn 


He 


Ala 


Thr 


Phe 


Lys 


Val 


Ala 


Ala 


Val 


Gin 


1 








5 










10 










15 




Ala 


Thr 


Pro 


Val 


Phe 


Leu 


Asp 


Arg 


Glu 


Ala 


Thr 


He 


Asp 


Lys 


Ala 


Cys 








20 










25 










30 






Ala 


Leu 


lie 


Ala 


Thr 


Ala 


Gly 


Ser 


Glu 


Gly 


Ala 


Arg 


Leu 


He 


Val 


Phe 






35 










4a 










45 








Pro 


Glu 


Ala 


Phe 


He 


Pro 


Thr 


Tyr 


Pro 


Glu 


Trp 


Val 


Trp 


Gly 


He 


Pro 




50 










55 










60 










Ser 


Gly 


Glu 


Gin 


Gly 


Leu 


Leu 


Asn 


Glu 


Leu 


Tyr 


Ala 


Glu 


Leu 


Leu 


Thr 


65 








70 










75 










80 


Asn 


Ala 


Val 


Thr 


He 


Pro 


Ser 


Asp 


Ala 


Thr 


Asp 


Arg 


Leu 


Cys 


Glu 


Ala 










85 










90 










95 




Ala 


Gin 


Leu 


Ala 


Asn 


Ala 


Tyr 


Val 


Val 


Met 


Gly Met 


Ser 


Glu Arg 


Asn 








100 










105 










110 






Val 


Glu 


Ala 


Ser 


Gly 


Ala 


Ser 


Leu 


Tyr 


Asn 


Thr 


Leu 


Leu 


Tyr 


He 


Asn 






115 










120 










125 








Ala 


Gin 


Gly 


Glu 


He 


Leu 


Gly 


Lys 


His 


Arg 


Lys 


Leu 


Val 


Pro 


Thr 


Gly 




130 








135 










140 










Gly 


Glu 


Arg 


Leu 


Val 


Trp 


Ala 


Gin 


Gly Asp 


Gly 


Ser 


Thr 


Leu 


Gin 


Val 


145 








150 










155 










160 


Tyr 


Asp 


Thr 


Pro 


Leu 


Gly 


Lys 


Leu 


Gly 


Gly 


Leu 


He 


Cys 


Trp 


Glu 


Asn 
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tgctacattc aaagttgccg cagtccaggc cacacctgtg 60 

catcgacaaa gcttgcgcgt tgattgccac tgctggcagt 120 

gtttccagaa gcattcatcc caacttatcc tgaatgggta 180 

gcaaggttta ctcaacgaac tctatgcaga gttgctcacc 240 

cgatgcgact gacaggctgt gcgaggctgc gcagcttgcg 300 

catgagcgaa cggaacgtcg aggcgagtgg cgcaagcctg 360 

aaatgcgcag ggggagattt tagggaaaca tcgaaagctg 42 0 

cctggtatgg gcgcagggtg atggcagtac gctgcaggtc 480 

actcggtggc ttaatttgct gggaaaatta tatgccgctg 54 0 

ctggggaaca caaatctatg tcgcggcaac gtgggatcgc 600 

attaaggcat atcgccaaag aaggcagggt atacgtgatt 660 

taaagacgat attccagatc gttacaccat gaagcaaaaa 720 

atggatgaat gttggtgaca gtgtgattgt caatcccgag 780 

tgtgcgcaag caggaagaaa ttctctacgc ggagattgat 8 40 

gaagtggatg ctcgatgtgg cagggcatta tgcgagaccg 900 

gcatacggat gtgaggcgga tgatgcgggt ggaagatgat 960 
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165 



Tyr 


Met 


Pro 


Leu 


Aid 


Arg 


Tyr 


JT1-L Ct 






180 










Tyr 


Val 


Ala 


Ala 


i nr 


Trp 


Asp 


Arg 




195 










^. u u 


Arg 


His 


lie 


Ala 


Lys 




o _L y 


Arg 


210 










215 




Ala 


Met 


Arg 


Lys 


Asp 


Asp 


Tip 


Pro 


225 
















Tyr 


Tyr 


r\J- d 


Glu 


Met 


Asp 


Glu 


Trp 






245 








Val 


Asn 


Pro 


Glu 
260 


Gly 


His 


Phe 


He 


Glu 


lie 


Leu 


Tyr 


Ala 


Glu 


lie 


Asp 




275 








280 


Trp 


Met 


Leu 


Asp 


Val 


Ala 


Gly 


His 


290 










295 




Leu 


Thr 


Val 


His 


Thr 


Asp 


Val 


Arg 


305 










310 






Ser 

















170 175 
Met Tyr Ala Trp Gly Thr Gin He 
185 190 
Gly Gin Pro Trp Leu Ser Thr Leu 

205 

Val Tyr Val He Gly Cys Cys He 
220 

Asp Arg Tyr Thr Met Lys Gin Lys 
235 240 
Met Asn Val Gly Asp Ser Val He 

250 255 
Ala Gly Pro Val Arg Lys Gin Glu 
265 270 
Pro Arg Met Val Gin Gly Pro Lys 
285 

Tyr Ala Arg Pro Asp Val Phe Gin 
300 

Arq Met Met Arg Val Glu Asp Asp 
315 320 



<210> 97 
<211> 1017 
<212> DNA 
<213> Unknown 

<220> n _ 

<223> Obtained from an environmental sample 



atgggcatcg aacatccgaa atacaaggtc gccgtggtgc aagctgcgcc cgcctggctc 60 

gacc?cgacg cgtcgatcga caagacgatc gggctgatcg aggaggcggc gaagaaaggc 120 

accaaactaa tcgctttccc cgaagccttc attcccggct acccttggca catctggctc 180 

gactcacccg cSgggcgat cggccgcggt ttcgtgcagc gctatttcga caattcgctc 240 

gcctacgaca gcccacaggc ggaaaggctg cgacaggccg tgcggaaggc caagctcacc 300 

qccgtgltcg gcctgtccga gcgcgacggc ggcagcctct atctcgcgca gtggctgatc 360 

aaacccgS ^tgagaccat cgcaaagcgc cgcaagctgc ggccgaccca tgccgagcgc 420 

accgtcCtg gcgaaggcga cggcagcgat ctcgccgtcc atgagcgggc cgacatcggc 480 

caactcqqcg cgctgtgctg ctgggagcat ctgcagccgc tgtcgaaatt cgccatgtac 540 

qcccagaacg a^caggLca tgicgcggcc tggccgagct tctcgctcta cgatcccttc 600 

qcqoc^Scgc tgggcgcgga ggtgaacaac gccgcctccc gcatctatgc ggtggaaggc 660 

?cctgc?tcg tgcLgclcc gigcgcgacg gtctcgcagg ccatgatcga cgagctctgc 720 

gSggccgg acaagcacgc gctgctgcat gccggcggcg gcttcgccgc ^tctacggg 780 

cccgacggca gccagatcgg cgacaagctg ccgcccgagc aggagggcct ^tgatcgcc 840 

a^atcaatc taqqcgcgat cggcgtcgcc aagaacgcgg ccgatcccgc cgggcattat yuu 

?^cggcccg acg^acgcg gcLctgctc aacaggaagc cgaacaagcg cgtggagcag 960 

ttcgcgctgc ccgtcgacac ggtcgagccc gtcgacgtcg cggcggcagc aagctga 1017 



<210> 98 
<211> 338 
<212> PRT 
<213> Unknown 



<220> _ , 

<223> Obtained from an environmental sample 



Me^Gly'lle Glu His Pro Lys Tyr Lys Val Ala Val Val Gin Ala Ala 
Pro Ala Trp Leu Asp Leu Asp Ala Ser He Asp Lys Thr He Gly Leu 
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20 25 30 

He Glu Glu Ala Ala Lys Lys Gly Ala Lys Leu He Ala Phe Pro Glu 

35 40 45 

Ala Phe He Pro Gly Tyr Pro Trp His He Trp Leu Asp Ser Pro Ala 

50 55 J 60 

Trp Ala He Gly Arg Gly Phe Val Gin Arg Tyr Phe Asp Asn Ser Leu 
65 70 75 80 

Ala Tyr Asp Ser Pro Gin Ala Glu Arg Leu Arg Gin Ala Val Arg Lys 

85 90 95 

Ala Lys Leu Thr Ala Val He Gly Leu Ser Glu Arg Asp Gly Gly Ser 

100 105 HO 

Leu Tyr Leu Ala Gin Trp Leu He Gly Pro Asp Gly Glu Thr He Ala 

115 120 125 

Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Tyr Gly 

130 135 140 

Glu Gly Asp Gly Ser Asp Leu Ala Val His Glu Arg Ala Asp He Gly 
145 150 155 160 

Arq Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gin Pro Leu Ser Lys 

165 170 175 

Phe Ala Met Tyr Ala Gin Asn Glu Gin Val His Val Ala Ala Trp Pro 

180 185 190 

Ser Phe Ser Leu Tyr Asp Pro Phe Ala Pro Ala Leu Gly Ala Glu Val 

195 200 205 

Asn Asn Ala Ala Ser Arg lie Tyr Ala Val Glu Gly Ser Cys Phe Val 

210 215 220 

Leu Ala Pro Cys Ala Thr Val Ser Gin Ala Met He Asp Glu Leu Cys 
225 230 235 240 

Asp Arg Pro Asp Lys His Ala Leu Leu His Ala Gly Gly Gly Phe Ala 

245 250 255 

Ala He Tyr Gly Pro Asp Gly Ser Gin He Gly Asp Lys Leu Pro Pro 

260 265 270 

Glu Gin Glu Gly Leu Leu He Ala Glu He Asp Leu Gly Ala He Gly 

275 280 285 

Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp 

290 295 300 

Val Thr Arg Leu Leu Leu Asn Arg Lys Pro Asn Lys Arg Val Glu Gin 
305 310 315 320 

Phe Ala Leu Pro Val Asp Thr Val Glu Pro Val Asp Val Ala Ala Ala 
325 330 335 

Ala Ser 



<210> 99 
<211> 1014 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



60 
120 
180 
240 
300 



<400> 93 

atgcctgaca agagaatcgt ccgcgccgcc gcggtccaga tagcaccgga cctcgaacgg 
cccggtggca cgctcgagaa ggtcctcgag acgatcgacg acgccgcacg ccagggcgtg 
cagctcatcg tcttccccga gaccttcctg ccctactacc cgtacttttc gttcgtgcgg 
gcgccggtgg catcgggtgc agagcacatg cggctctatg acgaagcggt ggtcgtgccc 
gggccggtga cgcatgcggt ggccgagcgg gcacggcggc acggcatggt cgtcgtgctc 
ggcgtgaacg agcgcgatca cggcagctta tacaacgcac aactgatctt cgataccgac 360 
ggcgagctgc tgctcaagcg ccgcaagatc acgccgacgt ttcacgaacg gatgatctgg 420 
ggcatgggcg acgcagccgg cctgaaggta gcggaaacgc gtatcggccg ggtgggtgca 4 80 

ctcgcttgct gggaacacta caacccgctt gcacgttatg cactgatgac ccagcacgaa 540 
gagattcatt gcagccagtt tcccggctcg ctggtcggac ccatcttcgg tgaacagatc 600 
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gaagtgacca tccggcatca cgcactggaa tccggctgct tcgtgatcaa ttccaccggc 660 

tggctgaccg agccgcagat cgagtcgatc acgaaagatc cgggcctgca gaaggcgctt 72 0 

cgcggcggct gcaacacggc gatcatctcg cccgaaggcc agcatctcgc cccgccgctg 7 80 

cgtgagggcg agggcatggt catcgctgac ctggacatgt cgctgatcac caaacgcaaa 84 0 

cgcatgatgg attctgtcgg ccactacgcg cggcccgaac tgctgagcct cgccatcaac 900 

gaccggccgg cggtcacgtc ggcacccatg aacagcttct catcttcaac cgggggattg 960 

caccttgaac gcgaacgaga ccttgtcggc cgtgagccgg caattgatga ctga 1014 

<210> 100 
<211> 337 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 100 

Met Pro Asp Lys Arg lie Val Arg Ala Ala Ala Val Gin lie Ala Pro 

15 10 15 

Asp Leu Glu Arg Pro Gly Gly Thr Leu Glu Lys Val Leu Glu Thr lie 

20 25 30 

Asp Asp Ala Ala Arg Gin Gly Val Gin Leu He Val Phe Pro Glu Thr 

35 40 45 

Phe Leu Pro Tyr Tyr Pro Tyr Phe Ser Phe Val Arg Ala Pro Val Ala 

50 55 60 

Ser Gly Ala Glu His Met Arg Leu Tyr Asp Glu Ala Val Val Val Pro 
65 70 75 80 

Gly Pro Val Thr His Ala Val Ala Glu Arg Ala Arg Arg His Gly Met 

85 90 95 

Val Val Val Leu Gly Val Asn Glu Arg Asp His Gly Ser Leu Tyr Asn 

100 105 HO 

Ala Gin Leu He Phe Asp Thr Asp Gly Glu Leu Leu Leu Lys Arg Arg 

115 120 125 

Lys He Thr Pro Thr Phe His Glu Arg Met He Trp Gly Met Gly Asp 

130 135 140 

Ala Ala Gly Leu Lys Val Ala Glu Thr Arg He Gly Arg Val Gly Ala 
145 150 155 160 

Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met 

165 170 175 

Thr Gin His Glu Glu He His Cys Ser Gin Phe Pro Gly Ser Leu Val 

180 185 190 

Gly Pro He Phe Gly Glu Gin He Glu Val Thr He Arg His His Ala 

195 200 205 

Leu Glu Ser Gly Cys Phe Val He Asn Ser Thr Gly Trp Leu Thr Glu 

210 215 220 

Pro Gin He Glu Ser He Thr Lys Asp Pro Gly Leu Gin Lys Ala Leu 
225 230 235 240 

Arg Gly Gly Cys Asn Thr Ala He He Ser Pro Glu Gly Gin His Leu 

245 250 255 

Ala Pro Pro Leu Arg Glu Gly Glu Gly Met Val He Ala Asp Leu Asp 

260 265 270 

Met Ser Leu He Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His 

275 280 285 

Tyr Ala Arg Pro Glu Leu Leu Ser Leu Ala He Asn Asp Arg Pro Ala 

290 295 300 

Val Thr Ser Ala Pro Met Asn Ser Phe Ser Ser Ser Thr Gly Gly Leu 
305 310 315 320 

His Leu Glu Arg Glu Arg Asp Leu Val Gly Arg Glu Pro Ala He Asp 
325 330 335 

Asp 
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<210> 101 
<211> 1065 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 101 

atggcgacag 

gacctcgacg 

gcccggctca 

ggtgctccgg 

cagtacggca 

gtcgcgctcg 

ggacccgacg 

acggtgttcg 

gtcggtgcgc 

cagaacgagc 

catgcgcttg 

tgctttgtga 

tcgcccgaga 

gacggcgcgc 

atcgacctcg 

cgcccggacg 

gcgctgccgg 

ccgagtgtcg 



tccatccgaa 
cgtcggtgga 
tcgcgtttcc 
cctgggcgat 
ccccggaagc 
gactgtcaga 
gcgagacggt 
gcgaaggcga 
tgtgctgttg 
aagttcatat 
gcgccgaggt 
ttgcgccatg 
agcatcagtt 
cactcgccaa 
gcatgatttc 
tgacccgtct 
tcgaccagga 
ctgcgttcac 



atttaaagta 
aaaagcggtg 
agagactttt 
catgcgcggc 
cgaccggctg 
gcgcgacggc 
cgcaacgcgc 
tggctcgcac 
ggagcacctg 
cgcggcgtgg 
caacaacgcg 
cgcgaccgtt 
cctgcacgtc 
gccactggcg 
ggtcgcgaaa 
gttgttcaac 
ggcagaggcg 
actgacgcag 



gccgccgtcc 
cgcctgattg 
atccccggtt 
ttcgtctccc 
cgggcagccg 
ggcagtctct 
cgcaagctca 
cttgcggtcc 
cagccactgt 
ccgagctttt 
gcgagcaaga 
tcccaggcga 
ggcggcggtt 
cccgatcagg 
gcggcggccg 
aatcgtcctg 
ggagcaggcg 
gcggcagccg 



aggcggcccc 
atgaagccgg 
atccgtggtg 
gctatttcga 
ccaaacgcaa 
acatcgccca 
agcctactca 
acgaacttga 
cgaagtacgc 
cgctttacga 
tctacgcggt 
tgatcgacga 
tcgccgtgat 
agggtctcct 
atccggctgg 
ggaaccgggt 
gcaaacctgc 
agtag 



ggcctttctc 
cgccgctggt 
gatctggctc 
caactcgctg 
caaaatgttc 
atggattatc 
cgccgagcgg 
tatcgggcgg 
gatgtatgcg 
tccgttcgcg 
cgaaggctca 
attgtgtgac 
ctatggtccc 
ttacgcggat 
acattacgcg 
ggagacactc 
gcccaagtca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1065 



<210> 102 
<211> 354 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from 



an environmental sample 



<400> 102 

Met Ala Thr Val His Pro Lys 

1 5 
Pro Ala Phe Leu Asp Leu Asp 
20 

lie Asp Glu Ala Gly Ala Ala 
35 

Thr Phe lie Pro Gly Tyr Pro 

50 55 
Trp Ala lie Met Arg Gly Phe 
65 70 
Gin Tyr Gly Thr Pro Glu Ala 
85 

Asn Lys Met Phe Val Ala Leu 
100 

Leu Tyr lie Ala Gin Trp lie 
115 

Thr Arg Arg Lys Leu Lys Pro 
130 135 
Glu Gly Asp Gly Ser His Leu 
145 150 
Val Gly Ala Leu Cys Cys Trp 
165 

Ala Met Tyr Ala Gin Asn Glu 



Phe Lys Val 
10 

Ala Ser Val 
25 

Gly Ala Arg 
40 

Trp Trp lie 

Val Ser Arg 

Asp Arg Leu 
90 

Gly Leu Ser 

105 
lie Gly Pro 
120 

Thr His Ala 

Ala Val His 

Glu His Leu 
170 

Gin Val His 



Ala Ala Val 

Glu Lys Ala 

Leu lie Ala 
45 

Trp Leu Gly 
60 

Tyr Phe Asp 
75 

Arg Ala Ala 

Glu Arg Asp 

Asp Gly Glu 
125 

Glu Arg Thr 

140 
Glu Leu Asp 
155 

Gin Pro Leu 
lie Ala Ala 



Gin Ala Ala 
15 

Val Arg Leu 
30 

Phe Pro Glu 

Ala Pro Ala 

Asn Ser Leu 
80 

Ala Lys Arg 
95 

Gly Gly Ser 
110 

Thr Val Ala 

Val Phe Gly 

He Gly Arg 
160 

Ser Lys Tyr 

175 
Trp Pro Ser 
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1 on 










Phe 


Ser 


Leu 
i y o 


Tyr 


Asp 


Pro 


Phe 


Ala 
9 n n 

Z U U 


Asn 


Ala 
210 


Ala 


Ser 


Lys 


He 


Tyr 


Ala 


Ala 


Pro 


Cys 


Ala 


Thr 


Val 


Ser 


Gin 


225 










z o u 






Ser 


Pro 


Glu 


Lys 


nlS 


bin 


irne 


Leu 


He 


Tyr 


Gly 


Pro 
2 60 


Asp 


Gly 


Ala 


Pro 


Gin 


Glu 


Gly 
27 5 


Leu 


Leu 


Tyr 


Ala 


Asp 
9 a n 


Ala 


Lys 

290 


Ala 


Ala 


Ala 


Asp 


Pro 
295 




Thr 


Arg 


Leu 


Leu 


Phe 


Asn 


Asn 


Arg 


305 










310 






Ala 


Leu 


Pro 


Val 


Asp 
325 


Gin 


Glu 


Ala 


Ala 


Pro 


Lys 


Ser 
340 


Pro 


Ser 


Val 


Ala 


Ala 


Glu 















185 










190 






His 


Ala 


Leu 


Gly 


Ala 
205 


Glu 


Val 


Asn 


Val 


Glu 


Gly 


Ser 
220 


Cys 


Phe 


Val 


He 


Ala 


Met 


He 
9 ^ S 


Asp 


Glu 


Leu 


Cys 


Asp 

240 


His 


Val 

250 


Gly 


Gly 


Gly 


Phe 


Ala 
255 


Val 


Leu 


Ala 


Lys 


Pro 


Leu 


Ala 


Pro 


Asp 


Z OD 










97 0 








Asp 


Leu 


Gly 


ric l. 

<£, O J 


He 


S er 


Val 


Gly 


His 


Tyr 


Ala 
300 


Arg 


Pro 


Asp 


Val 


Pro 


Gly Asn 


Arg 


Val 


Glu 


Thr 


Leu 






315 










320 


Glu 


Ala 


Gly Ala 


Gly 


Gly 


Lys 


Pro 




330 










335 




Ala 


Phe 


Thr 


Leu 


Thr 


Gin 


Ala 


Ala 


345 










350 







<210> 103 
<211> 945 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 103 

atgggcgagt 

cgggaggcgt 

gacctgttgg 

tggtctcggg 

gggccggaga 

ggcgtggtgg 

tcgcgcgagg 

cggtactggt 

ttgagcggat 

caggggacgc 

ctgtcgcggg 

cctgcgccgg 

gagagctgca 

acgctgatta 

gtgggtggcc 

ccgcgagtgg 



tcggcgaggt 
cgacggagaa 
cgttcgggga 
agtacaacga 
cggacgcgct 
agctggagcc 
gcgagatcct 
cagagggtga 
tgaactgctg 
agtttcatgt 
cgtatgcgta 
gagagctacc 
tcatcgaccc 
cggcgcgggt 
actactcgcg 
tgtttcggga 



gacgctgggg 
ggctcgcggc 
gacgtggctg 
cctgcgcgcg 
atgccaggca 
ggggagcctt 
ggggcggcac 
tgcgacgggg 
ggaacacctt 
ggcagcgtgg 
ccaggccgga 
ggacggcatc 
gtggggaaaa 
atcgaccgcg 
accggacgtg 
tggggatgtg 



gtggcgcagg 
ctgatccggg 
acggggtacc 
cggtacgtgg 
gcggcggaag 
tcgagcgtgt 
cggaagctga 
ctgcgggtgt 
atgatgttgc 
ccgaacatgg 
tgctacgtgt 
gcggcggagt 
gtgatcgcgg 
tcaatctacc 
ttccggttcg 
gacgaccggg 



cggcgccggt 
aggcggggga 
cgtactggaa 
cgaatggcgt 
cgggggtgga 
attgcacgtt 
agccgacgga 
acgagcggcc 
cggggtacgc 
cgagctcggc 
tgtgcgcggg 
cgctggacca 
ggccggtgtc 
ggcgcaagtc 
aggtggatag 
ggtaa 



gtacttcgac 
gaagggcgtc 
ggatgcgccg 
gatgataccg 
cgtggcaatc 
gctgttcatc 
ttcggaacgg 
atatggccgg 
gctggcggca 
gagcgagctg 
cgggctcggg 
cctgacgggc 
gtgcgaggag 
gctgacggac 
gtcggagcgc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
945 



<210> 104 
<211> 314 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 104 

Met Gly Glu Phe Gly Glu Val Thr Leu Gly Val Ala Gin Ala Ala Pro 
15 10 15 
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Val Tyr Phe Asp Arg Glu Ala Ser Thr Glu Lys Ala Arg Gly Leu He 

20 25 30 

Arg Glu Ala Gly Glu Lys Gly Val Asp Leu Leu Ala Phe Gly Glu Thr 

35 40 45 

Trp Leu Thr Gly Tyr Pro Tyr Trp Lys Asp Ala Pro Trp Ser Arg Glu 

50 55 60 

Tyr Asn Asp Leu Arg Ala Arg Tyr Val Ala Asn Gly Val Met He Pro 
65 70 75 80 

Gly Pro Glu Thr Asp Ala Leu Cys Gin Ala Ala Ala Glu Ala Gly Val 

85 90 95 

Asp Val Ala He Gly Val Val Glu Leu Glu Pro Gly Ser Leu Ser Ser 

100 105 11° 

Val Tyr Cys Thr Leu Leu Phe He Ser Arg Glu Gly Glu He Leu Gly 

115 120 125 

Arg His Arg Lys Leu Lys Pro Thr Asp Ser Glu Arg Arg Tyr Trp Ser 

130 135 140 

Glu Gly Asp Ala Thr Gly Leu Arg Val Tyr Glu Arg Pro Tyr Gly Arg 
145 150 155 160 

Leu Ser Gly Leu Asn Cys Trp Glu His Leu Met Met Leu Pro Gly Tyr 

165 170 175 

Ala Leu Ala Ala Gin Gly Thr Gin Phe His Val Ala Ala Trp Pro Asn 

180 185 190 

Met Ala Ser Ser Ala Ser Glu Leu Leu Ser Arg Ala Tyr Ala Tyr Gin 

195 200 205 

Ala Gly Cys Tyr Val Leu Cys Ala Gly Gly Leu Gly Pro Ala Pro Gly 

210 215 220 

Glu Leu Pro Asp Gly He Ala Ala Glu Ser Leu Asp His Leu Thr Gly 
225 230 235 240 

Glu Ser Cys lie He Asp Pro Trp Gly Lys Val He Ala Gly Pro Val 

245 250 255 

Ser Cys Glu Glu Thr Leu He Thr Ala Arg Val Ser Thr Ala Ser He 

260 265 270 

Tyr Arg Arg Lys Ser Leu Thr Asp Val Gly Gly His Tyr Ser Arg Pro 

275 280 285 

Asp Val Phe Arg Phe Glu Val Asp Arg Ser Glu Arg Pro Arg Val Val 

290 295 300 

Phe Arg Asp Gly Asp Val Asp Asp Arg Gly 
305 310 

<210> 105 
<211> 975 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

aJgaccattg tcaaagccgc tgccgtccag attgcccccg ttctctacag ^tgaaggc J° 

ac?gtagaaa aggtcgttaa caagattcgc gaactcggcg agaagggcgt gcagttcgcc 120 

c?t?tccctg aaaccgtcgt accgtactac ccgtactttt cctttgtgca gagccctttc 180 

aaaatgggt? ccgagcacla caaattgctc gaccaggccg ttgtcgtgcc Stcggcgacc 240 

accgatgcca tcggcaaagc ggccaaggaa gccaacatgg tggtgtccat cggcgtcaac 300 

gaacgcgatg gcagcaccct ctacaacacg cagttgctgt ttgatgccga cggcactttg 360 

attcaggccc gtcgcaagat ttcaccgacc taccacgaac ^atgatctg JTO«tgggo 420 

gacggttccg gcctgcgcgc caccgacagc gcggtcgggc gcatcggaca attggcctgc 4b u 

?gggaacat? acaaLcgct ggcgcgttac gccttgatcg aagacggcga aca a 540 
gcctcgatgt accogggctc gttcgcaggt cctttattca ctcgccagat S^gtcagc 
atccgcatgc atgccctgga atcggcgtgc ttcgtggtca actogaccgc gtggttgtac 
ccggaacagc aagcccagat catggccgac accggttgcg agatcgggcc gatctccggc 
ggc?gctaca ccgcgatcat cgacocacag ggtgaagtcg tcggcgcact gaccgaaggc 



600 
660 
720 
780 
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gagggcgaag tgattgccga catcgatctg ttccagatcg aaatccgtaa acgtcagatg 840 

gacggccgtg gtcactacag ccgtccggaa atcctgagcc tgaacatcga ccgtacgccg 900 

catcgccatg ttcacgaacg caacgaccag cagaaaccgg gtgtgatcga cactgctgaa 960 

gaaaccgggc gttga 975 

<210> 106 
<211> 324 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from, an environmental sample 
<400> 106 

Met Thr lie Val Lys Ala Ala Ala Val Gin lie Ala Pro Val Leu Tyr 

15 10 15 

Ser Arg Glu Gly Thr Val Glu Lys Val Val Asn Lys lie Arg Glu Leu 

20 25 30 

Gly Glu Lys Gly Val Gin Phe Ala Val Phe Pro Glu Thr Val Val Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser Phe Val Gin Ser Pro Phe Lys Met Gly Ser 

50 55 60 

Glu His Tyr Lys Leu Leu Asp Gin Ala Val Val Val Pro Ser Ala Thr 
65 70 75 80 

Thr Asp Ala lie Gly Lys Ala Ala Lys Glu Ala Asn Met Val Val Ser 

85 90 95 

lie Gly Val Asn Glu Arg Asp Gly Ser Thr Leu Tyr Asn Thr Gin Leu 

100 105 110 

Leu Phe Asp Ala Asp Gly Thr Leu lie Gin Ala Arg Arg Lys He Ser 

115 120 125 

Pro Thr Tyr His Glu Arg Met He Trp Gly Met Gly Asp Gly Ser Gly 

130 135 140 

Leu Arg Ala Thr Asp Ser Ala Val Gly Arg He Gly Gin Leu Ala Cys 
145 150 155 160 

Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu He Glu Asp Gly 

165 170 175 

Glu Gin He His Ala Ser Met Tyr Pro Gly Ser Phe Ala Gly Pro Leu 

180 185 190 

Phe Thr Arg Gin Met Glu Val Ser He Arg Met His Ala Leu Glu Ser 

195 200 205 

Ala Cys Phe Val Val Asn Ser Thr Ala Trp Leu Tyr Pro Glu Gin Gin 

210 215 220 

Ala Gin He Met Ala Asp Thr Gly Cys Glu He Gly Pro He Ser Gly 
225 230 235 240 

Gly Cys Tyr Thr Ala He He Asp Pro Gin Gly Glu Val Val Gly Ala 

245 250 255 

Leu Thr Glu Gly Glu Gly Glu Val He Ala Asp He Asp Leu Phe Gin 

260 265 270 

He Glu He Arg Lys Arg Gin Met Asp Gly Arg Gly His Tyr Ser Arg 

275 280 285 

Pro Glu He Leu Ser Leu Asn He Asp Arg Thr Pro His Arg His Val 

290 295 300 

His Glu Arg Asn Asp Gin Gin Lys Pro Gly Val He Asp Thr Ala Glu 
305 310 315 320 

Glu Thr Gly Arg 



<210> 107 
<211> 981 
<212> DNA 
<213> Unknown 
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<220> 

<223> Obtained from an environmental sample 



<400> 107 

atggccatca 

accgtggaca 

gtgttcccgg 

gccatgggtg 

accctggcga 

gaac.gcgatg 

attcagcatc 

gatggttccg 

tgggaacatt 

gccgcgatgt 

atccgccatc 

gcggatcagc 

ggctgcttca 

ggggaaggcg 

atggattcag 

ccgactgcgc 

gtcgcccatg 



ttcgcgcagc 
aggtctgcca 
aaacggtggt 
cgcaacacct 
ttggcgaagc 
gcggtacgat 
ggcgcaaaat 
gtctgcgcgc 
acaacccgct 
ttcccggttc 
acgcattgga 
agggccaaat 
ccgcgatcgt 
tagtgattgc 
taggtcatta 
atgttcatga 
cagaccaatg 



agccgtacag 
gcagatcatc 
gccgtactac 
caaattgcta 
gtgcaagcaa 
ttacaacgcg 
caccccgacc 
catcgacagc 
ggcccgttac 
cctggtgggt 
gtcaggatgc 
aatgcaggac 
atcgccggaa 
cgacctcgat 
cagtcgtccc 
acgcgccggc 
a 



atcagcccgg 
acccttggca 
ccctattttt 
gatcaatctg 
gcaggaatgg 
caattactct 
taccacgagc 
gcggtcgggc 
gccttgatgg 
gacatcttcg 
ttcgtggtca 
acaggttgcg 
gggaagctac 
acggccttga 
gagctgctca 
tttgtttcga 



ttctttacag 
aacagggtgt 
cctttgtgca 
taaccgtgcc 
tcgtttccat 
tcgatgctga 
gcatggtctg 
gcatcggctc 
cagacggcga 
ccgagcagat 
atgcaacagc 
gccttggtcc 
ttggagagcc 
tcgacaagcg 
gcctattgat 
gcaacgccgg 



<210> 108 
<211> 326 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



ccgcgaaggc 
gcagttcgcc 
gccggcgttc 
atcggccgcc 
cggagtcaat 
cggcacgctg 
ggggcagggc 
cctggcatgc 
acagatccac 
cgaggtcacc 
ctggctggat 
catctcgggc 
gcttcgctcc 
caaacggatg 
cgatcgatcg 
tttgcaggag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
981 



<400> 108 

Met Ala lie lie Arg Ala Ala Ala Val Gin lie 

15 10 
Ser Arg Glu Gly Thr Val Asp Lys Val Cys Gin 

20 25 
Gly Lys Gin Gly Val Gin Phe Ala Val Phe Pro 

35 40 
Tyr Tyr Pro Tyr Phe Ser Phe Val Gin Pro Ala 

50 55 
Gin His Leu Lys Leu Leu Asp Gin Ser Val Thr 
65 70 75 

Thr Leu Ala He Gly Glu Ala Cys Lys Gin Ala 

85 90 
He Gly Val Asn Glu Arg Asp Gly Gly Thr He 

100 105 
Leu Phe Asp Ala Asp Gly Thr Leu He Gin His 

115 120 
Pro Thr Tyr His Glu Arg Met Val Trp Gly Gin 

130 135 
Leu Arg Ala He Asp Ser Ala Val Gly Arg He 
145 150 155 

Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala 

165 170 
Glu Gin He His Ala Ala Met Phe Pro Gly Ser 

180 185 
Phe Ala Glu Gin He Glu Val Thr lie Arg His 

195 200 
Gly Cys Phe Val Val Asn Ala Thr Ala Trp Leu 

'210 215 
Gly Gin He Met Gin Asp Thr Gly Cys Gly Leu 



Ser 

Gin 

Glu 

Phe 

60 

Val 

Gly 

Tyr 

Arg 

Gly 
140 
Gly 

Leu 

Leu 

His 

Asp 
220 
Gly 



Pro 

He 

Thr 
45 
Ala 

Pro 

Met 

Asn 

Arg 
125 
Asp 

Ser 

Met 

Val 

Ala 
205 
Ala 

Pro 



Val 

He 

30 

Val 

Met 

Ser 

Val 

Ala 
110 
Lys 

Gly 

Leu 

Ala 

Gly 
190 
Leu 

Asp 

He 



Leu Tyr 
15 

Thr Leu 

Val Pro 

Gly Ala 

Ala Ala 

80 
Val Ser 
95 

Gin Leu 

He Thr 

Ser Gly 

Ala Cys 
160 
Asp Gly 
175 

Asp He 
Glu Ser 
Gin Gin 
Ser Gly 
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225 










230 










235 










240 


Gly 


Cys 


Phe 


Thr 


Ala 
245 


lie 


Val 


Ser 


Pro 


Glu 
250 


Gly 


Lys 


Leu 


Leu 


Gly 
255 


Glu 


Pro 


Leu 


Arg 


Ser 
260 


Gly 


Glu 


Gly 


Val 


Val 
265 


lie 


Ala 


Asp 


Leu 


Asp 
270 


Thr 


Ala 


Leu 


lie 


Asp 
275 


Lys 


Arg 


Lys 


Arg 


Met 
280 


Met 


Asp 


Ser 


Val 


Gly 
285 


His 


Tyr 


Ser 


Arg 


Pro 
290 


Glu 


Leu 


Leu 


Ser 


Leu 
295 


Leu 


lie 


Asp 


Arg 


Ser 
300 


Pro 


Thr 


Ala 


His 


Val 


His 


Glu 


Arg 


Ala 


Gly 


Phe 


Val 


Ser 


Ser 


Asn 


Ala 


Gly 


Leu 


Gin 


Glu 


305 










310 










315 










320 


Val 


Ala 


His 


Ala 


Asp 
325 


Gin 























<210> 109 
<211> 1092 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 109 

atggccatca 

accgtggaca 

gtgttcccgg 

gcgatgggtg 

accctggcga 

gaacgcgatg 

attcagcatc 

gatggttccg 

tgggaacatt 

gccgcgatgt 

atccgccatc 

gcggatcagc 

ggctgcttca 

ggggaaggcg 

atggattcag 

ccgactgcgc 

ggtcgcgcat 

cggccgcgca 

gagcctaggt 



ttcgcgcagc 
gggtctgcca 
aaacggtggt 
cacaacacct 
ttggcgaagc 
gcggtacgat 
ggcgcaaaat 
gtctgcgcgc 
acaacccgct 
ttcccggttc 
acgcattgga 
agggccaaat 
ccgcgatcgt 
tagtgattgc 
taggtcatta 
attttcatga 
gcagaccaat 
aatggcttgt 
aa 



agccgtacag 
gcagatcatc 
gccgtactac 
caaattgctc 
gtgcaagcaa 
ttacaacgcg 
caccccgacc 
catcgacagc 
ggcccgttac 
cctggtgggt 
atcaggatgc 
aatgcaggac 
atcgccggaa 
cgacctcgat 
cagtcgtccc 
acgcgccggg 
gaattgctcg 
cgcgccaagg 



atcagcccgg 
acccttggca 
ccctattttt 
gatcaatctg 
gcaggaatgg 
caattactct 
taccacgagc 
gcggtcgggc 
gccttgatgg 
gacatcttcg 
ttcgtggtca 
acaggttgcg 
gggaagctac 
atggccttga 
gagctgctca 
ctttgttccg 
ctgacctgca 
cggcgccggt 



ttctttacag 
aacaaggtgt 
cctttgtgca 
taaccgtgcc 
tcgtttccat 
tcgatgctga 
gcatggtctg 
gcatcggctc 
cagacggcga 
ccgagcagat 
atgcaacagc 
gccttggtcc 
ttggagagcc 
tcgacaagcg 
gcctattgat 
agcgacgccg 
aatccaaggc 
ccttcagacc 



ccgcgaaggc 
gcagttcgcc 
gccggcattt 
atcggccgcc 
cggcgtcaat 
cggcactctg 
ggggcagggc 
cctggcatgc 
acagatccac 
cgaggtcacc 
ttggctggat 
catctcgggc 
gcttcgctca 
caaacggatg 
cgatcgatcg 
gtttgcagga 
ctgcgttggc 
acaaggcgct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1092 



<210> 110 
<211> 363 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
<400> 110 

Met Ala lie lie Arg Ala Ala Ala Val Gin He Ser Pro Val Leu Tyr 

1 5 10 15 

Ser Arg Glu Gly Thr Val Asp Arg Val Cys Gin Gin He He Thr Leu 

20 25 30 

Gly Lys Gin Gly Val Gin Phe Ala Val Phe Pro Glu Thr Val Val Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser Phe Val Gin Pro Ala Phe Ala Met Gly Ala 

50 55 60 

Gin His Leu Lys Leu Leu Asp Gin Ser Val Thr Val Pro Ser Ala Ala 



Page 72 



WO 03/097810 



PCT/US03/15712 



65 70 75 80 



Thr 


Leu 


Ala 


He 


Gly 


Glu 


Ala 


Cys 


Lys 


Gin 


Ala 


Gly 


Met 


Val 


Val 


Ser 










85 










90 










95 




lie 


Gly 


Val 


Asn 


Glu 


Arg 


Asp 


Glv 


Gly 


Thr 


He 


Tyr 


Asn 


Ala 


Gin 


Leu 








100 










105 










110 






LeU 


Phe 


Asp 


Ala 


Asp 


Gly 


Thr 


Leu 


He 


Gin 


His 


Arg 


Arg 


Lvs 


He 


Thr 






115 










120 










125 








Prn 

s: i. \J 


X Hi. 


iyr 


His 


Glu 


Arg 


Met 


Val 




Gly 


Gin 


Gly Asp 


Gly 


Ser 


Gly 












135 










140 










Leu 


A YTT 

y 


n JL ct 


He 


Asp 


Ser 


Ala 


Val 


Gly 


Arg 


He 


Gly 


Ser 


Leu 


Ala 


Cys 


X *± •»> 








150 










155 










160 


irp 


vjl U. 


T-H CI 


Tvt* 
j.yx 


Asn 


Pro 


Leu 


Ala 


Arg 


Tvr 


Ala 


Leu 


Met 


Ala 


Asp 


Gly 








165 










170 










175 




oJLU 


oxn 


xx.e 


nio 


rtX a. 


Ala 


Met 


Phe 


Pro 


Glv 


Ser 


Leu 


Val 


Glv 


Asp 


He 








180 










185 










190 








r\J.ct 


m n 
ox u. 


Gin 


He 


Glu 


Val 


Thr 


He 


Arg 


His 


His 


Ala 


Leu 


Glu 


Ser 






195 










200 










205 












Phe 


Val 


Val 


Asn 


Ala 


Thr 


Ala 


Trr> 


Leu 


Asp 


Ala 


Asp 


Gin 


Gin 


210 










215 










220 












Vj XH 


He 


Met 


Gin 


Asp 


Thr 


Glv 


Cys 


Glv 


Leu 


Glv 


Pro 


He 


Ser 


Gly 


/L £. *J 










230 










235 










240 


Gly 


CyS 


Phe 


Thr 


Ala 


He 


Val 


Ser 


Pro 


Glu 


Gly 


Lys 


Leu 


Leu 


Gly 


Glu 






245 










250 










255 




Pro 


Leu 


Arg 


Ser 


Gly 


Glu 


Glv 


Val 


Val 


He 


Ala 


Asp 


Leu 


Asp 


Met 


Ala 






2 60 










265 










270 






Leu 


He 


Asp 


Ly s 


Arg 


Ly s 


Arg 


Met 


Met 


Asp 


Ser 


Val 


Gly 


His 


Tyr 


Ser 






275 










280 










285 








Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


Leu 


Leu 


He 


Asp 


Arg 


,Ser 


Pro 


Thr 


Ala 


His 


290 










295 










300 










Phe 


His 


Glu 


Arg 


Ala 


Gly 


Leu 


Cys 


Ser 


Glu 


Arg 


Arg 


Arg 


Phe 


Ala 


Gly 


305 








310 










315 










320 


Gly Arg 


Ala 


c ys 


Arg 


Pro 


Met 


Asn 


Cys 


Ser 


Leu 


Thr 


Cys 


Lys 


Ser 


Lys 










325 










330 










335 




Ala 


Cys 


Val 


Gly 


Arg 


Pro 


Arg 


Lys 


Trp 


Leu 


Val 


Ala 


Pro 


Arg 


Arg 


Arg 






340 










345 










350 






Arg 


Ser 


Phe 


Arg 


Pro 


Gin 


Gly Ala 


Glu 


Pro 


Arg 
















355 










360 



















<210> 111 
<211> 990 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 111 

atgcccaaaa 

ggcaccgtcg 

atcgtatttc 

gtccagcaag 

gaaacacggg 

aatgagcgcg 

ctgaaactca 

ggcgatggcg 

tgctgggagc 

cacgcctctc 

accatgcgcc 

agcgaggagc 

ggctgcatga 

ggcgaaggca 



cagtacgtgc 
agcgggtcct 
ccgagacctt 
gccctgagca 
ccgtcgcgga 
accacggctc 
agcgtcgcaa 
ccggcctgaa 
attacaatcc 
atttcccggg 
accacgcgtt 
agatcgcatc 
cctgcatcat 
tcctgatcgg 



cgcagcagtc 
caatgcaatc 
cgtgccctgg 
tcttcgtctt 
cgccgcgcgc 
gctctataac 
gatcacgccg 
ggttgtcgac 
tctggcccgc 
ctcactggtc 
ggaatcgggc 
tattcatccg 
cacgccggaa 
cgatctggac 



cagatcgcgc 
gccgaagctg 
tatccctatt 
tatgaggaag 
aaacgcaatg 
actcagctga 
acctatcacg 
actgccgtcg 
tatactttga 
ggcccgatat 
tgtttcgtgg 
gaccccgcct 
ggacgccatg 
atgcggctca 



ccgacctgac 
ctgacaaagg 
tcagtttcgt 
cagtcacggt 
cggttatcgt 
tcttcgacgc 
aacggatgat 
gtcgcgtggg 
tggcccagca 
tcggcgagca 
tcaatgccac 
tgcaaaaggg 
tcgtaccgcc 
ttaccaagcg 



gtcacgcgcc 
cgccgagctg 
tctgccacct 
accatcagca 
ccttggcgtc 
ggatggcagc 
ctggggccaa 
tgccctggca 
tgaggaaatt 
aatcgaagtc 
cggctggctg 
cctgcgcgat 
gctgacctcg 
caagcggatg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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atggattcgg tcggacacta tgctcggcct gaactgctgc accttgtcca tgacacgacg 
cccgcacgcg cacgcgagca ggtcggcctt tcaggcgatt ttcccgatgc ggagcaagac 
aagctatttg aggaggttca taatgcgtga 



<210> 112 
<211> 329 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 112 

Met Pro Lys Thr Val Arg Ala Ala Ala Val Gin He Ala Pro Asp Leu 

15 10 15 

Thr Ser Arg Ala Gly Thr Val Glu Arg Val Leu Asn Ala He Ala Glu 

20 25 30 

Ala Ala Asp Lys Gly Ala Glu Leu He Val Phe Pro Glu Thr Phe Val 

35 40 45 

Pro Trp Tyr Pro Tyr Phe Ser Phe Val Leu Pro Pro Val Gin Gin Gly 

50 55 60 

Pro Glu His Leu Arg Leu Tyr Glu Glu Ala Val Thr Val Pro Ser Ala 
65 70 75 80 

Glu Thr Arg Ala Val Ala Asp Ala Ala Arg Lys Arg Asn Ala Val He 

85 90 95 

Val Leu Gly Val Asn Glu Arg Asp His Gly Ser Leu Tyr Asn Thr Gin 

100 105 HO 

Leu He Phe Asp Ala Asp Gly Ser Leu Lys Leu Lys Arg Arg Lys He 

115 120 125 

Thr Pro Thr Tyr His Glu Arg Met He Trp Gly Gin Gly Asp Gly Ala 

130 135 140 

Gly Leu Lys Val Val Asp Thr Ala Val Gly Arg Val Gly Ala Leu Ala 
145 150 155 160 

Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Thr Leu Met Ala Gin 

165 170 175 

His Glu Glu He His Ala Ser His Phe Pro Gly Ser Leu Val Gly Pro 

180 185 . 190 

He Phe Gly Glu Gin He Glu Val Thr Met Arg His His Ala Leu Glu 

195 200 205 

Ser Gly Cys Phe Val Val Asn Ala Thr Gly Trp Leu Ser Glu Glu Gin 

210 215 220 

He Ala Ser He His Pro Asp Pro Ala Leu Gin Lys Gly Leu Arg Asp 
225 230 235 240 

Gly Cys Met Thr Cys He He Thr Pro Glu Gly Arg His Val Val Pro 

245 250 255 

Pro Leu Thr Ser Gly Glu Gly He Leu He Gly Asp Leu Asp Met Arg 

260 265 270 

Leu He Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His Tyr Ala 

275 280 285 

Arg Pro Glu Leu Leu His Leu Val His Asp Thr Thr Pro Ala Arg Ala 

290 295 300 

Ara Glu Gin Val Gly Leu Ser Gly Asp Phe Pro Asp Ala Glu Gin Asp 
305 310 315 320 

Lys Leu Phe Glu Glu Val His Asn Ala 
325 



<210> 113 
<211> 993 
<212> DNA 
<213> Unknown 
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<220> 

<223> Obtained from an environmental sample 



<400> 113 

atgacgaagg 

gtcggcgaca 

aagggcgccg 

cgcgtgcacg 

gcagcgacgc 

tacaaccgct 

ccgacgatgt 

ggcatcggcg 

cgcatgctcg 

aactggggca 

tccgagaatg 

cactgcggaa 

tatgacaccg 

caccgtcgca 

acgcccgagc 

gcctactggc 

gccggcctgg 



aacgcgccgc 
tcgccgccaa 
aactgatcgc 
acgccgtgct 
acaactgccg 
accacttcgt 
gggagaacgc 
gcgtcggcgc 
gtcgcgtcga 
gcgccgtcgc 
cacccaccga 
gcttccgcac 
agtacgtcgg 
cgcaggaagg 
tgccactgga 
accaccagaa 
cggcggcgga 



gcgcagcctg 
tctcgcacgc 
cattccggag 
gccgccggac 
gctcggcggc 
cgaacccgac 
cttctacacc 
cgcggtgtgc 
cgtcgccatg 
gcgcacgctg 
attcgcccgc 
cggtttcttg 
cgccacacag 
ccccggtgtc 
agaccgcttc 
cctgtgcggc 
gcgttcggca 



cgcgcagctg 
atcgaggcga 
ttctgcacct 
aacttcgtgg 
tccatgctca 
ggcagcgtgc 
ggcggctccg 
tgggaactgg 
accggcacgc 
gccgcgatga 
cgcctgggtg 
ctgctgccag 
atcgtcgatg 
gtcgtcgccg 
tggattcccg 
aagtcctact 
tga 



ccatacagct 
tggtcgagga 
cccgcatgcc 
tcgatgcctt 
ttgccgacgg 
atctgcacga 
acgacggcgt 
tacgcaccgg 
attggtggac 
cgcagtacaa 
tgccggtgct 
gcagcgggcg 
ccgatggcca 
acatcacgct 
agctgccgct 
accgtcgcgt 



tgaagccgaa 
ggctgcgggc 
cttcgatgca 
tcgccgcatg 
tggcgagatc 
caaggatctg 
cttcgacacc 
caccgtgcga 
gatgccgcac 
ccgctacatg 
gcaggcctcg 
tgcactgccc 
catcctcgcc 
cggtgcccgc 
cttcctcaag 
cggccgcgat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
993 



<210> 114 
<211> 330 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from 



an environmental sample 



<400> 114 

Met Thr Lys Glu Arg 

1 5 
Leu Glu Ala Glu Val 
20 

Ala Met Val Glu Glu 
35 

Pro Glu Phe Cys Thr 
50 

Ala Val Leu Pro Pro 
65 

Ala Ala Thr His Asn 
85 

Gly Gly Glu lie Tyr 
100 

Val His Leu His Asp 
115 

Tyr Thr Gly Gly Ser 
130 

Val Gly Ala Ala Val 
145 

Arg Met Leu Gly Arg 
165 

Thr Met Pro His Asn 
180 

Met Thr Gin Tyr Asn 
195 

Ala Arg Arg Leu Gly 
210 

Phe Arg Thr Gly Phe 
225 



Ala Ala Arg Ser 
Gly Asp 
Ala Ala 



Ser Arg 

55 
Asp Asn 
70 

Cys Arg 

Asn Arg 

Lys Asp 

Asp Asp 
135 
Cys Trp 
150 

Val Asp 

Trp Gly 

Arg Tyr 

Val Pro 
215 
Leu Leu 
230 



lie Ala 

25 
Gly Lys 
40 

Met Pro 



Phe Val 

Leu Gly 

Tyr His 
105 
Leu Pro 
120 

Gly Val 

Glu Leu 

Val Ala 

Ser Ala 
185 
Met Ser 
200 

Val Leu 
Leu Pro 



Leu Arg 
10 

Ala Asn 

Gly Ala 

Phe Asp 

Val Asp 

75 
Gly Ser 
90 

Phe Val 

Thr Met 

Phe Asp 

Val Arg 
155 
Met Thr 
170 

Val Ala 

Glu Asn 

Gin Ala 

Gly Ser 
235 



Ala Ala Ala 
Leu 
Glu 



Ala 
60 
Ala 

Met 

Glu 

Trp 

Thr 
140 
Thr 

Gly 

Arg 

Ala 

Ser 
220 
Gly 



Ala Arg 

30 
Leu lie 
45 

Arg Val 



Phe Arg 

Leu lie 

Pro Asp 
110 
Glu Asn 
125 

Gly lie 

Gly Thr 

Thr His 

Thr Leu 
190 
Pro Thr 
205 

His Cys 
Arg Ala 



He Gin 
15 

He Glu 

Ala He 

His Asp 

Arg Met 

80 
Ala Asp 
95 

Gly Ser 

Ala Phe 

Gly Gly 

Val Arg 
160 
Trp Trp 
175 

Ala Ala 

Glu Phe 

Gly Ser 

Leu Pro 
240 
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Tyr Asp Thr Glu Tyr Val Gly Ala Thr Gin lie Val Asp Ala Asp Gly 

245 250 255 

His lie Leu Ala His Arg Arg Thr Gin Glu Gly Pro Gly Val Val Val 

260 265 270 

Ala Asp lie Thr Leu Gly Ala Arg Thr Pro Glu Leu Pro Leu Glu Asp 

275 280 285 

Arg Phe Trp lie Pro Glu Leu Pro Leu Phe Leu Lys Ala Tyr Trp His 

290 295 300 

His Gin Asn Leu Cys Gly Lys Ser Tyr Tyr Arg Arg Val Gly Arg Asp 
305 310 315 320 

Ala Gly Leu Ala Ala Ala Glu Arg Ser Ala 
325 330 

<210> 115 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 115 

atgaaccaaa 

ggtacagtca 

gccgtatttc 

tttgccatgg 

gcaacaactt 

aatgagcgtg 

attattcagc 

ggcgatggca 

tgttgggagc 

catgcggcga 

accattcagc 

gacccagagc 

gggggatgtt 

caaggcgaag 

atgatggatt 

cgtcctacct 

aaagtgtctg 



tcattaaagc 
agaaaatctg 
ctgaaactgt 
gtaaagaaca 
taattggaca 
caggcggcac 
atcgccgtaa 
gtggtttacg 
attacaaccc 
tgtttccggg 
accatgcttt 
aacaacaaca 
ttacggccat 
gttatgtgat 
ctgttgggca 
cagttttgca 
aatttgccga 



ggcggcagtt 
tgacacgatt 
tgtgccttat 
tttaaagcta 
ggcatgcaaa 
gatttataac 
aattacccca 
tgcgatagat 
tttggctcgg 
atcactcgtg 
agagtcgggc 
aattatgcaa 
cgtttctcca 
tgccgattta 
ttatagtcgt 
tgagttaaaa 
ggtacacgca 



caatgtagcc 
ttggagttgg 
tacccttatt 
ttgcatgaat 
gaagcgaaca 
gctcaattgt 
acgtatcatg 
tctgctgtag 
tttgctttga 
gggcagattt 
tgttttgtgg 
gatacaggct 
gaaggcaaat 
gacttttcct 
ccagaattac 
ctagagaatc 
tag 



ctgtgttgta 
ggcagcaagg 
tttcttttgt 
cggttgtcgt 
tggtggtttc 
tgtttgatgc 
aacgtatggt 
gacgtattgg 
tggcggatgg 
ttgcagatca 
tgaatgccac 
gtgaactcgg 
ttttgtctga 
taatcgaaaa 
tcagtttgtt 
catcgaataa 



tagccaagcg 
tgtgcaattt 
gcaaccaccg 
gccatcggca 
tattggtatt 
ggatggttcg 
gtgggggcaa 
gtcgctggca 
tgagcaaatt 
gatcagtgcc 
agcatggctt 
tccaatttcg 
accgatcaca 
acgtaaacgg 
gattgatcgt 
cagcatcgaa 



60 
120 
180 
240 
300 
360 
420 
480 
5 40 
600 
660 
720 
780 
840 
900 
960 
993 



<210> 116 
<211> 330 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from 



an environmental sample 



<400> 116 

Met Asn Gin lie lie 

1 5 
Tyr Ser Gin Ala Gly 
20 

Leu Gly Gin Gin Gly 
35 

Pro Tyr Tyr Pro Tyr 
50 

Lys Glu His Leu Lys 
65 

Ala 'Thr Thr Leu lie 
85 



Lys Ala Ala Ala Val Gin Cys Ser Pro Val Leu 

10 15 

Thr Val Lys Lys He Cys Asp Thr He Leu Glu 

25 30 

Val Gin Phe Ala Val Phe Pro Glu Thr Val Val 

40 45 

Phe Ser Phe Val Gin Pro Pro Phe Ala Met Gly 

55 60 

Leu Leu His Glu Ser Val Val Val Pro Ser Ala 
70 75 80 
Gly Gin Ala Cys Lys Glu Ala Asn Met Val Val 
90^ 95 
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Ser He 

Leu Leu 

Thr Pro 
130 
Gly Leu 
145 

Cys Trp 

Gly Glu 

He Phe 

Ser Gly 
210 
Gin Gin 
225 

Gly Gly 

Glu Pro 

Ser Leu 

Ser Arg 
290 
Val Leu 
305 

Lys Val 



Gly 

Phe 
115 
Thr 

Arg 

Glu 

Gin 

Ala 
195 
Cys 

Gin 

Cys 

He 

He 
275 
Pro 

His 

Ser 



He Asn 
100 

Asp Ala 

Tyr His 

Ala He 

His Tyr 
165 
He His 
180 

Asp Gin 

Phe Val 

He Met 

Phe Thr 
245 
Thr Gin 

260 

Glu Lys 

Glu Leu 

Glu Leu 

Glu Phe 
325 



Glu 

Asp 

Glu 

Asp 
150 
Asn 

Ala 

He 

Val 

Gin 
230 
Ala 

Gly 

Arg 

Leu 

Lys 
310 
Ala 



Arg 

Gly 

Arg 
135 
Ser 

Pro 

Ala 

Ser 

Asn 
215 
Asp 

He 

Glu 

Lys 

Ser 
295 
Leu 

Glu 



Ala Gly 
105 
Ser He 
120 

Met Val 

Ala Val 

Leu Ala 

Met Phe 
185 
Ala Thr 
200 

Ala Thr 

Thr Gly 

Val Ser 

Gly Tyr 
265 
Arg Met 
280 

Leu Leu 
Glu Asn 
Val His 



Gly 

He 

Trp 

Gly 

Arg 
170 
Pro 

He 

Ala 

Cys 

Pro 
250 
Val 

Met 

He 

Pro 

Ala 

330 



Thr He 

Gin His 

Gly Gin 
140 
Arg He 
155 

Phe Ala 

Gly Ser 

Gin His 

Trp Leu 
220 
Glu Leu 
235 

Glu Gly 

He Ala 

Asp Ser 

Asp Arg 
300 
Ser Asn 
315 



Tyr Asn 
110 
Arg Arg 
125 

Gly Asp 

Gly Ser 

Leu Met 

Leu Val 
190 
His Ala 

205 

Asp Pro 

Gly Pro 

Lys Phe 

Asp Leu 
270 
Val Gly 
285 

Arg Pro 
Asn Ser 



Ala Gin 

Lys He 

Gly Ser 

Leu Ala 
160 
Ala Asp 
175 

Gly Gin 

Leu Glu 

Glu Gin 

He Ser 
240 
Leu Ser 
255 

Asp Phe 

His Tyr 

Thr Ser 

He Glu 
320 



<210> 117 
<211> 957 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 117 

atgactcaat 

ccaggtaaca 

cagattattg 

ccgcccattc 

ggctcggaaa 

ggtgtgaacg 

ggaaagattc 

gggcagggtg 

ttggcctgtt 

gaaattcact 

gaagtgacca 

tggctttctg 

agaggcggtt 

accgagggtg 

cgcatgatgg 

gatgaggctt 



ccaggataat 
cgatcgacaa 
ttttccctga 
aacagggcaa 
ccgaggcaat 
agcgcgatca 
tgttgaagcg 
acgcttcagg 
gggaacacta 
gtgcgcagtt 
ttcgtcacca 
aagaacaggt 
gtttcacggc 
agggcatggt 
attcagtggg 
caagccccat 



tcgtgctgcg 
agtttgccgc 
aaccttggtg 
agaacatttg 
ttcagctctt 
cggcacactt 
ccgaaaaatt 
cttgaaggtg 
caaccccttg 
tcccggttca 
cgcacttgag 
tcaaagtatt 
cattgtcagc 
gatcgccgac 
ccattatgcg 
gaaaaaaatt 



gcagcgcaga 
accatcagcg 
ccttattacc 
cggctgtatg 
gccgcccaac 
tacaacgcac 
acaccaactt 
gttgattccg 
gctcgctatt 
ttggtggggc 
tcgggctgtt 
tcatccgaca 
cctgagggaa 
ctcgacatgg 
cgccccgagt 
cagggagttc 



tcgctccgga 
aggcggccgc 
cttacttctc 
accatgcagt 
acaatatggt 
aaattatttt 
atcacgagcg 
cagtgggccg 
gtttgatggc 
aagtttttgc 
ttgtcatcaa 
gcgcattgca 
agctgttggc 
cgttggttac 
tgttgagttt 
aacatgctga 



tttgcaggtt 
aaaaggcgta 
ttacatttca 
ggttgtgccc 
ggtggttttg 
caacagcgac 
gatggtgtgg 
tgtgggtgca 
ccagcacgaa 
cgaccaaatg 
cagcaccgct 
gaaagggctt 
tgagccgctc 
gaaacgcaaa 
gctggttcgg 
gtactga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
957 



<210> 118 
<211> 318 
<212> PRT 
<213> Unknown 
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<220> 

<223> Obtained from an environmental sample 



<400> 118 
Met Thr Gin Ser 
1 

Asp Leu Gin Val 
20 

Ser Glu Ala Ala 
35 

Leu Val Pro Tyr 
50 

Gin Gly Lys Glu 
65 

Gly Ser Glu Thr 

Val Val Val Leu 
100 

Ala Gin lie lie 
115 

Lys lie Thr Pro 
130 

Ala Ser Gly Leu 
145 

Leu Ala Cys Trp 

Ala Gin His Glu 
180 

Gly Gin Val Phe 
195 

Leu Glu Ser Gly 
210 

Glu Gin Val Gin 
225 

Arg Gly Gly Cys 

Ala Glu Pro Leu 
260 

Met Ala Leu Val 
275 

Tyr Ala Arg Pro 
290 

Ser Pro Met Lys 
305 



Arg lie lie Arg 
5 

Pro Gly Asn Thr 

Ala Lys Gly Val 
40 

Tyr Pro Tyr Phe 
55 

His Leu Arg Leu 
70 

Glu Ala lie Ser 
85 

Gly Val Asn Glu 

Phe Asn Ser Asp 
120 

Thr Tyr His Glu 
135 

Lys Val Val Asp 
150 

Glu His Tyr Asn 
165 

Glu lie His Cys 

Ala Asp Gin Met 
200 

Cys Phe Val lie 
215 

Ser lie Ser Ser 
230 

Phe Thr Ala lie 
245 

Thr Glu Gly Glu 

Thr Lys Arg Lys 
280 

Glu Leu Leu Ser 
295 

Lys He Gin Gly 
310 



Ala Ala Ala Ala 
10 

He Asp Lys Val 

25 

Gin He He Val 

Ser Tyr He Ser 
60 

Tyr Asp His Ala 
75 

Ala Leu Ala Ala 
90 

Arg Asp His Gly 
105 

Gly Lys He Leu 

Arg Met Val Trp 
140 

Ser Ala Val Gly 
155 

Pro Leu Ala Arg 
170 

Ala Gin Phe Pro 
185 

Glu Val Thr He 

Asn Ser Thr Ala 
220 

Asp Ser Ala Leu 
235 

Val Ser Pro Glu 
250 

Gly Met Val He 

265 

Arg Met Met Asp 

Leu Leu Val Arg 
300 

Val Gin His Ala 
315 



Gin He Ala Pro 
15 

Cys Arg Thr He 
30 

Phe Pro Glu Thr 
45 

Pro Pro He Gin 

Val Val Val Pro 
80 

Gin His Asn Met 
95 

Thr Leu Tyr Asn 
110 

Leu Lys Arg Arg 
125 

Gly Gin Gly Asp 

Arg Val Gly Ala 
160 

Tyr Cys Leu Met 
175 

Gly Ser Leu Val 
190 

Arg His His Ala 
205 

Trp Leu Ser Glu 

Gin Lys Gly Leu 
240 

Gly Lys Leu Leu 
255 

Ala Asp Leu Asp 
270 

Ser Val Gly His 
285 

Asp Glu Ala Ser 
Glu Tyr 



<210> 119 
<211> 984 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 119 „ 

atggatacac tcaaagttgg attggttcag atggccccca tctggttgaa ccgggataaa 60 

accctgatca aagttgagga atacatgcag aaagcaggca aacagggctg caacctggta 120 

gcttttggtg aagcgctggt tcccggctac cccttctggg tggaacgcac agagggcgcc 180 

agattcaatt ccaaagtcca gaaagaactc tttgcacatt accttgatca ggcggtgcag 24 0 

atcgaagccg gccaccttga tcctctccag gcattagccc aacaatacaa gatggctgtg 300 

tacgtgggga cgattgaacg cccgcctgag cggagcggcc acagcctgta ctgctcccta 360 
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atatttatag 
gaggaacgcc 
gcctttaccg 
ctttatgcca 
tatgatataa 
atgatgaaaa 
agcgaggata 
atcgagccca 
agagaagaaa 
ctggtagtcg 
tcggatgaaa 



acccagaagg 
tggtctggtc 
ttggcggact 
tgggagagga 
ccaaattcat 
aagaaaatat 
ttatggctga 
tcgtcggaga 
gacagaattt 
accgcaggcg 
atcaatccct 



cgagatcggc 
aactggcgat 
caactgctgg 
tcttcatgtt 
tgccaaggaa 
cctctctgaa 
tggcggatcc 
ggaaaccctg 
cgacccaaca 
ccagcagatc 
ttaa 



<210> 120 
<211> 327 
<212> PRT 
<213> Unknown 



tcggttcacc 
gggcacggcc 
gaaaactgga 
gctgcctggc 
tctcgctctt 
attccccaca 
tgtctggctg 
gtaactgctg 
ggtcactaca 
ctggagatca 



gcaagttgat 
tgcgaacaca 
tgccgctctc 
ccgggagtca 
atgtgatctc 
gccaattgat 
gaccagatgg 
aactatcaca 
gtcggcctga 
ccccggacga 



gcccacccat 
ttctctgggc 
ccgcacagct 
gcgcaatact 
cgtatccggg 
gctggaaaat 
agaatggatc 
tcagcgggtc 
tgtgacccgc 
aaaaggaaga 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
984 



<220> 

<223> Obtained from 



an environmental sample 



<400> 120 

Met Asp Thr Leu Lys Val Gly Leu Val Gin Met Ala Pro 

15 10 
Asn Arg Asp Lys Thr Leu lie Lys Val Glu Glu Tyr Met 

20 25 
Gly Lys Gin Gly Cys Asn Leu Val Ala Phe Gly Glu Ala 

35 40 45 

Gly Tyr Pro Phe Trp Val Glu Arg Thr Glu Gly Ala Arg 

50 55 60 

Lys Val Gin Lys Glu Leu Phe Ala His Tyr Leu Asp Gin 
65 70 75 

lie Glu Ala Gly His Leu Asp Pro Leu Gin Ala Leu Ala 

85 90 
Lys Met Ala Val Tyr Val Gly Thr lie Glu Arg Pro Pro 

100 105 
Glv His Ser Leu Tyr Cys Ser Leu He Phe He Asp Pro 
115 120 125 

He Gly Ser Val His Arg Lys Leu Met Pro Thr His Glu 

130 135 140 

Val Trp Ser Thr Gly Asp Gly His Gly Leu Arg Thr His 
145 150 155 

Ala Phe Thr Val Gly Gly Leu Asn Cys Trp Glu Asn Trp 

165 170 
Ser Arg Thr Ala Leu Tyr Ala Met Gly Glu Asp Leu His 

180 185 
Trp Pro Gly Ser Gin Arg Asn Thr Tyr Asp He Thr Lys 
195 200 - 205 

Lys Glu Ser Arg Ser Tyr Val He Ser Val Ser Gly Met 

210 215 220 

Glu Asn He Leu Ser Glu He Pro. His Ser Gin Leu Met 
225 230 235 

Ser Glu Asp He Met Ala Asp Gly Gly Ser Cys Leu Ala 

245 250 
Gly Glu Trp He He Glu Pro He Val Gly Glu Glu Thr 

260 265 
Ala Glu Leu Ser His Gin Arg Val Arg Glu Glu Arg Gin 
275 280 285 

Pro Thr Gly His Tyr Ser Arg Pro Asp Val Thr Arg Leu 

290 295 300 

Arg Arg Arg Gin Gin He Leu Glu He Thr Pro Asp Glu 
305 310 315 



He Trp Leu 
15 

Gin Lys Ala 
30 

Leu Val Pro 

Phe Asn Ser 

Ala Val Gin 
80 

Gin Gin Tyr 
95 

Glu Arg Ser 
110 

Glu Gly Glu 

Glu Arg Leu 

Ser Leu Gly 
160 

Met Pro Leu 

175 
Val Ala Ala 
190 

Phe He Ala 

Met Lys Lys 

Leu Glu Asn 
240 

Gly Pro Asp 

255 
Leu Val Thr 
270 

Asn Phe Asp 

Val Val Asp 

Lys Gly Arg 
320 



Page 79 



WO 03/097810 



PCT/US03/15712 



Ser Asp Glu Asn Gin Ser Leu 
325 



<210> 121 
<211> 1158 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 121 

atgagcaaaa 

cccgttttca 

gggagaaatc 

tactataccg 

caggataact 

gaggcagggg 

acagtttaca 

aaactcaaac 

aaggtttttg 

actctagtca 

ggaaactgga 

ctctgcaacc 

agcgcttgcg 

ggtaaccata 

cgttatctcg 

aaccagataa 

attgcccaac 

tcttcgactg 

gtaactgcgg 

aaacaagccg 



aagttctagg 
tggacaagga 
gagccgagct 
tcggctatga 
ccgtgctgat 
cttatgcagt 
acacgctcct 
ccacgttcac 
ataccgagat 
gggccgcgat 
agggtgcgga 
ttcaatctct 
gctttttgac 
ttaactgcga 
tcgagcctaa 
aagcagtaaa 
tggcgataag 
aagttgagct 
ataagttgga 
gttcctaa 



cggcagagaa 
gaagacgatt 
catagcgttc 
aaccccttct 
tccgagcgag 
aataggatgc 
cttcatcggc 
ggagagaata 
cggcaggatc 
gatacacagg 
aaacaagctt 
cattaaagta 
gccagaggat 
ctgggcactg 
ctttgagaag 
agcggttttt 
gcaggaagcc 
tccggcagac 
atctttgctt 



aaagtaaaag 
gaaaaggctt 
tcagagtcat 
caagaatgga 
gataccgagg 
agcgagatgg 
aaagacggca 
tactggggag 
ggaggcctcg 
ggagaggagt 
ctccaagcag 
cacgcctttg 
ttcccggaaa 
ggcggaagct 
gatgcaatcc 
gattcccttg 
tggaatccag 
gagcttcgaa 
gaggaaattg 



ttgcagtagt 
gcaagctaat 
tcatccccgt 
gagattacgt 
tactcggaca 
acgaccgtcc 
aggtcatggg 
agggagatgc 
tatgctggga 
ttcatatcgc 
ataatagccc 
aggccggggc 
ggtggcatta 
caatcgtcaa 
tctatgcgga 
gccactattc 
aggtttcttt 
ggatttcgga 
gaaagattaa 



tcaggctgcg 
aaaagaagcg 
ctatcctgca 
gattgcgcta 
ggctgcaaag 
gggaagccga 
aaggcataga 
tggagacata 
gaaccatatg 
ggtctggccg 
aggaggcgcc 
gtttgtgctg 
tataagggat 
tcccgccggc 
ttgttatgca 
ccgctgggat 
gatcgattcc 
gaagtttgaa 
aaagcccagg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1158 



<210> 122 
<211> 385 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 122 



Met 


Ser 


Lys 


Lys 


Val 


Leu 


Gly 


Gly 


Arg 


Glu 


Lys 


Val 


Lys 


Val 


Ala 


Val 


1 






5 










10 










15 




Val 


Gin 


Ala 


Ala 
20 


Pro 


Val 


Phe 


Met 


Asp 
25 


Lys 


Glu 


Lys 


Thr 


He 
30 


Glu 


Lys 


Ala 


Cys 


Lys 


Leu 


He 


Lys 


Glu 


Ala 


Gly Arg 


Asn 


Arg 


Ala 


Glu 


Leu 


He 




35 










40 










45 








Ala 


Phe 


Ser 


Glu 


Ser 


Phe 


He 


Pro 


Val 


Tyr 


Pro 


Ala 


Tyr 


Tyr 


Thr 


Val 




50 










55 










60 






Ala 




Gly 


Tyr 


Glu 


Thr 


Pro 


Ser 


Gin 


Glu 


Trp 


Arg 


Asp 


Tyr 


Val 


He 


Leu 


65 








70 










75 










80 


Gin 


Asp 


Asn 


Ser 


Val 


Leu 


He 


Pro 


Ser 


Glu 


Asp 


Thr 


Glu 


Val 


Leu 


Gly 








85 










90 










95 




Gin 


Ala 


Ala 


Lys 
100 


Glu 


Ala 


Gly 


Ala 


Tyr 
105 


Ala 


Val 


He 


Gly 


Cys 
110 


Ser 


Glu 


Met 


Asp 


Asp 


Arg 


Pro 


Gly 


Ser 


Arg 


Thr 


Val 


Tyr 


Asn 


Thr 


Leu 


Leu 


Phe 




115 










120 










125 








He 


Gly 


Lys 


Asp 


Gly 


Lys 


Val 


Met 


Gly Arg 


His 


Arg 


Lys 


Leu 


Lys 


Pro 




130 










135 










140 








He 


Thr 


Phe 


Thr 


Glu 


Arg 


He 


Tyr 


Trp 


Gly 


Glu 


Gly 


Asp 


Ala 


Gly Asp 
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145 










J.OU 










155 










160 


Lys 


Val 


Pne 


Asp 


Thr 


(jJLU 


x xe 


Gly Arg 


X X fci 


Gly 


Gly 


Jjc u. 


V Ct X 


^* j 


Trn 








1 /re 

loo 










x / u 










175 




Glu 


Asn 


His 


Met 
180 


Thr 


Leu 


vax 


Arg 


TV 1 - 

A-La 
185 




Met 


He 


il-L o 


Jr\X. y 

*t on 


Gly 


Glu 


Glu 


Phe 


His 
195 


lie 


Ax a 


vax 


Trp 


Pro 
200 


Gly 


noil 


Trp 


Lys 


Gly 


Ala 


Glu 


Asn 


Lys 


Leu 


Leu 


Gin 


Ala 


Asp 


Asn 


Ser 


Pro 


ijxy 


Gly Ala 


Leu 






Leu 




210 










O 1 c 
ZS.D 










220 










Gin 


Ser 


Leu 


He 


Lys 


Val 


His 


Ala 


Phe 


Glu 


Ala 


Gly 


Ala 


Phe 


Val 


Leu 


225 








230 










235 












Ser 


Ala 


Cys 


Gly 


Phe 


Leu 


Thr 


Pro 


Glu 


Asp 


Phe 


Pro 


r* 1 n 


Arg 


Trp 


nio 






245 










ZOU 










zoo 




Tyr 


lie 


Arg 


Asp 


Gly 


Asn 


His 


He 


Asn 


Cys 


Asp 


Trp 


Ala 


Leu 


Gly 


Gly 






260 










265 










on r\ 






Ser 


Ser 


He 


Val 


Asn 


Pro 


Ala 


Gly Arg 


Tyr 


Leu 


Val 


LjJLU 


Pro 


Asn 








275 










280 










<L D D 








Glu 


Lys 
290 


Asp 


Ala 


He 


Leu 


Tyr 

zyo 


Ala 


Asp 


Cys 


Tyr 


Ala 
300 


Asn 




Tip 

x Xc 


L 

ys 


Ala 


Val 


Lys 


Ala 


Val 


Phe 


Asp 


Ser 


Leu 


Gly 


His 


Tyr 


Ser 


Arg 


Trp 


Asp 


305 


















315 










320 


lie 


Ala 


Gin 


Leu 


Ala 
325 


He 


Arg 


Gin 


Glu 


Ala 
330 


Trp 


Asn 


Pro 


Glu 


Val 
335 


Ser 


Leu 


He 


Asp 


Ser 


Ser 


Ser 


Thr 


Glu 


Val 


Glu 


Leu 


Pro 


Ala 


Asp 


Glu 


Leu 






340 










345 










350 






Arg 


Arg 


He 


Ser 


Glu 


Lys 


Phe 


Glu 


Val 


Thr 


Ala 


Asp 


Lys 


Leu 


Glu 


Ser 


355 










360 










365 








Leu 


Leu 
370 


Glu 


Glu 


He 


Gly 


Lys 
375 


He 


Lys 


Lys 


Pro 


Arg 
380 


Lys 


Gin 


Ala 


Gly 



Ser 
385 

<210> 123 
<211> 990 . 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 123 

atgtcaactt 

accattgaca 

gtggtcttcc 

ggacagcacc 

cccagtctgg 

gtgggcctta 

tacattgacg 

gaacgcatgg 

ggaaaactca 

tatgcctggg 

tccactctgc 

ttacgtaagg 

ggagagtgga 

gggcctctac 

gcccccaagt 

accgtgagca 

caagataaga 

<210> 124 
<211> 329 



tcaagatcgc 
agacttgcga 
ccgaggcctt 
agatgcttgc 
ctaccgagcg 
acgaacgcaa 
ccgagggcaa 
tctgggcaca 
gcggactaat 
gagtacagct 
ggcacattgc 
aagacatccc 
taaacaaagg 
ataaggaaca 
ggaacctgga 
aggatggcca 
ccgaagtatt 



taccgtgcag 
.gctgatcgcc 
tatccccacc 
cgacctgtac 
gctctgtcag 
tacagaggcc 
cttgctaggt 
gggagatggc 
ctgttgggag 
ctatttggct 
caaggaagga 
cgaccgattc 
tgacagcgtc 
ggggatactc 
tgtagccgga 
tccgatgctc 
agagggctag 



agtgcaccag 
gaagcagcac 
tatccggact 
ggggagttgc 
gctgcaaaga 
agcaacgcta 
aagcaccgaa 
agtacccttg 
aactacatgc 
cctacttggg 
cgagtatacg 
gaattcaagg 
atcgtcggtc 
tatgctgagc 
cactacgcgc 
ggcgttgccc 



tatttatgga 
aagatgacga 
gggtatggcg 
tcgagcagtc 
aagcgggcgt 
ccctgtacaa 
agctggtacc 
aggtctacga 
ctctcgctcg 
atcgaggcga 
tggtcggctg 
cgaagtacta 
ccgatggcga 
tggacacaag 
gcccggacgt 
aagggcccaa 



ccgcgaagct 
cgttcgccta 
tatccctccc 
ggtgacgata 
ttatgtagct 
caccctgctc 
gaccgctccc 
gacctccttc 
ttatgccctg 
gccctggctt 
ctctatcgcc 
cgcagaggca 
gctcatcgcc 
gcagatgcac 
gtttcggctg 
gcatgagccg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
990 



Page 81 



WO 03/097810 



PCT/US03/15712 



<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 124 

Met Ser Thr Phe Lys lie Ala Thr Val Gin Ser Ala Pro Val Phe Met 

15 10 15 

Asp Arq Glu Ala Thr He Asp Lys Thr Cys Glu Leu He Ala Glu Ala 

20 25 30 

Ala Gin Asp Asp Asp Val Arg Leu Val Val Phe Pro Glu Ala Phe He 

35 40 45 

Pro Thr Tyr Pro Asp Trp Val Trp Arg He Pro Pro Gly Gin His Gin 

50 55 60 

Met Leu Ala Asp Leu Tyr Gly Glu Leu Leu Glu Gin Ser Val Thr He 
65 70 75 80 

Pro Ser Leu Ala Thr Glu Arg Leu Cys Gin Ala Ala Lys Lys Ala Gly 

85 90 95 

Val Tyr Val Ala Val Gly Leu Asn Glu Arg Asn Thr Glu Ala . Ser Asn 

100 105 HO 

Ala Thr Leu Tyr Asn Thr Leu Leu Tyr He Asp Ala Glu Gly Asn Leu 

115 120 125 

Leu Gly Lys His Arg Lys Leu Val Pro Thr Ala Pro Glu Arg Met Val 

130 135 140 

Trp Ala Gin Gly Asp Gly Ser Thr Leu Glu Val Tyr Glu Thr Ser Phe 
145 " 150 155 160 

Glv Lys Leu Ser Gly Leu He Cys Trp Glu Asn Tyr Met Pro Leu Ala 

165 170 175 

Arg Tyr Ala Leu Tyr Ala Trp Gly Val Gin Leu Tyr Leu Ala Pro Thr 

180 185 190 

Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg His He Ala Lys 

195 200 205 

Glu Gly Arg Val Tyr Val Val Gly Cys Ser He Ala Leu Arg Lys Glu 

210 215 220 

Asp He Pro Asp Arg Phe Glu Phe Lys Ala Lys Tyr Tyr Ala Glu Ala 
225 230 235 240 

Gly Glu Trp He Asn Lys Gly Asp Ser Val He Val Gly Pro Asp Gly 

245 250 255 

Glu Leu He Ala Gly Pro Leu His Lys Glu Gin Gly He Leu Tyr Ala 

260 265 270 

Glu Leu Asp Thr Arg Gin Met His Ala Pro Lys Trp Asn Leu Asp Val 

275 280 285 

Ala Gly His Tyr Ala Arg Pro Asp Val Phe Arg Leu Thr Val Ser Lys 

290 295 300 

Asp Gly His Pro Met Leu Gly Val Ala Gin Gly Pro Lys His Glu Pro 
305 310 315 320 

Gin Asp Lys Thr Glu Val Leu Glu Gly 
325 

<210> 125 
<211> 1050 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 125 

atgacaactg taaaaaagac ggtacgcgca gcagcgatcc agatcgcacc tgacctcgac 
agtgcaggcg gtacgctgga caaggtttgc acggccattc aaaaggcggc ggcacaaggc 



60 
120 
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gcggagctgg tggtttttcc cgaaaccttc ttgccctact atccttactt ttcattcgtg 180 

cggccgccct tcgcatccgg cccggaacac ttgctgctat atgaacgcgc agtggcggtg 240 

ccaggcccgg tgaccgatgc cgtctctgcc gtcgcgcgca gccacggcgt ggtggtggta 300 

ctcggcgtca atgaacgcga ccatggcacg ctgtacaaca cccaactggt gttcgacgcg 360 

aatggcgaac tggtgttgaa acgcagaaaa atcacgccga cttatcacga gcggatgatc 420 

tggggtcaag gcgacggcag cggactcaaa gtagtgcaaa cggcggtcgg ccggctaggc 4 80 

gcgctagcct gttgggaaca ctacaaccca ctggcccgtt atgcattgat ggcgcaacac 540 

gaagaaatcc attgcgccca gtttcccggg tccatggtcg ggcaaatatt cgccgaccag 600 

atggaagtga cgatacgcca tcacgctctc gagtcggctt gcttcgtggt gaatgccaca 660 

ggctggctga ccgatgcgca aatcacatcg atcacgccgg accccgcgct acaaaaggca 720 

ttacgtggcg gttgctgcac cgccatcgtc tcgccggaag gtgtgctcct ggcagagccg 7 80 

ctacgcagcg gcgaaggcat ggtgatcgcc gatctcgata tggcactcat caccaaacgc 840 

aaacggatga tggattcggt cggccactat gcgcggcccg aattgttaag cctgcttgtc 900 

gacgaccggc gcaaggtacc ggtatccgcg ctatttgccg acagcaaccc tgccaacggg 960 

cacacagttt tcaccccatc cgacatacca acccttggga gcgcacatca tgcaaacagt 1020 

taccaaaccg aaccagcaac tgatcactga 1050 



<210> 126 
<211> 349 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 126 



Met 


Thr 


Thr 


Val 


Lys 


Lys 


Thr 


Val 


Arg 


Ala 


Ala 


Ala 


He 


Gin 


He 


Ala 


1 








5 










10 










15 




Pro 


Asp 


Leu 


Asp 


Ser 


Ala 


Gly 


Gly 


Thr 


Leu 


Asp 


Lys 


Val 


Cys 


Thr 


Ala 








20 










25 










30 






lie 


Gin 


Lys 


Ala 


Ala 


Ala 


Gin 


Gly 


Ala 


Glu 


Leu 


Val 


Val 


Phe 


Pro 


Glu 






35 










40 










45 








Thr 


Phe 


Leu 


Pro 


Tyr 


Tyr 


Pro 


Tyr 


Phe 


Ser 


Phe 


Val 


Arg 


Pro 


Pro 


Phe 




50 










55 










60 










Ala 


Ser 


Gly 


Pro 


Glu 


His 


Leu 


Leu 


Leu 


Tyr 


Glu 


Arg 


Ala 


Val 


Ala 


Val 


65 










70 










75 










80 


Pro 


Gly 


Pro 


Val 


Thr 


Asp 


Ala 


Val 


Ser 


Ala 


Val 


Ala 


Arg 


Ser 


His 


Gly 










85 










90 










95 




Val 


Val 


Val 


Val 


Leu 


Gly 


Val 


Asn 


Glu 


Arg 


Asp 


His 


Gly 


Thr 


Leu 


Tyr 








100 










105 










110 






Asn 


Thr 


Gin 


Leu 


Val 


Phe 


Asp 


Ala 


Asn 


Gly 


Glu 


Leu 


Val 


Leu 


Lys 


Arg 






115 










120 










125 








Arg 


Lys 


He 


Thr 


Pro 


Thr 


Tyr 


His 


Glu 


Arg 


Met 


He 


Trp 


Gly 


Gin 


Gly 




130 










135 










140 










Asp 


Gly 


Ser 


Gly 


Leu 


Lys 


Val 


Val 


Gin 


Thr 


Ala 


Val 


Gly 


Arg 


Leu 


Gly 


145 










150 










155 










160 


Ala 


Leu 


Ala 


Cys 


Trp 


Glu 


His 


Tyr 


Asn 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ala 


Leu 










165 










17a 










175 




Met 


Ala 


Gin 


His 


Glu 


Glu 


He 


His 


Cys 


Ala 


Gin 


Phe 


Pro 


Gly 


Ser 


Met 








180 










185 










190 






Val 


Gly 


Gin 


He 


Phe 


Ala 


Asp 


Gin 


Met 


Glu 


Val 


Thr 


He 


Arg 


His 


His 






195 










200 










205 








Ala 


Leu 


Glu 


Ser 


Ala 


Cys 


Phe 


Val 


Val 


Asn 


Ala 


Thr 


Gly 


Trp 


Leu 


Thr 




210 










215 










220 










Asp 


Ala 


Gin 


He 


Thr 


Ser 


He 


Thr 


Pro 


Asp 


Pro 


Ala 


Leu 


Gin 


Lys 


Ala 


225 










230 










235 










240 


Leu 


Arg 


Gly 


Gly 


Cys 


Cys 


Thr 


Ala 


He 


Val 


Ser 


Pro 


Glu 


Gly 


Val 


Leu 










245 










250 










255 




Leu 


Ala 


Glu 


Pro 


Leu 


Arg 


Ser 


Gly 


Glu 


Gly 


Met 


Val 


He 


Ala 


Asp 


Leu 








260 










265 










270 






Asp 


Met 


Ala 


Leu 


He 


Thr 


Lys 


Arg 


Lys 


Arg 


Met 


Met 


Asp 


Ser 


Val 


Gly 
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275 280 -285 

His Tyr Ala Arg Pro Glu Leu Leu Ser Leu Leu Val Asp Asp Arg Arg 

290 295 300 

Lys Val Pro Val Ser Ala Leu Phe Ala Asp Ser Asn Pro Ala Asn Gly 
305 310 315 320 

His Thr Val Phe Thr Pro Ser Asp lie Pro Thr Leu Gly Ser Ala His 

325 330 335 

His Ala Asn Ser Tyr Gin Thr Glu Pro Ala Thr Asp His 
340 345 

<210> 127 
<211> 1005 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 127 

atgatagcac 

cggaaggcga 

gaaattgtgg 

cctcccgtgt 

agccctacta 

ggcgtgaacg 

ggtgatattg 

ggacagggag 

ttggcgtgtt 

gaaattcatt 

gaagtgacga 

tggctgtttg 

aaagacggct 

acagaaggcg 

cggatgatgg 

aatgcagagc 

gacgtaaaaa 



ggaagacaat 
cgaccgacaa 
tgtttcctga 
tggcaggtaa 
ccgaccaagt 
aaagaagcca 
tattggtgag 
atggcagtgg 
gggaacacta 
gcagccaatt 
ttcgccacca 
aagagcaaat 
gtatgaccgc 
aaggcatcat 
actcggtagg 
aaaaaccagc 
gctatgatcg 



aagggcggcg 
ggtgtgcgcc 
aaccttcatt 
ggatcacctt 
ggggtctatg 
cggcactttg 
acgcaaaatc 
attaaaagca 
taatccactt 
ccctggctct 
tgcgttggaa 
ccaagccatc 
cattatttct 
ctacgcctat 
acactatgca 
cgtttacacc 
caacaaagaa 



gcggtgcaga 
tacattcagg 
cccaattatc 
accttgtatg 
gccaaaaaat 
tacaatgccc 
acccctacct 
gtagacacag 
gcgcgctacg 
atggtcggcc 
tcgggttgtt 
accgatgatc 
cccgaaggcg 
ctggacatga 
cgtccagagt 
tctcctctta 
cagcttatcg 



tagcgcctgt 
aagcaggcga 
cctatttctc 
accaagcggt 

ggggaatcgt 

aaattgtctt 
atcatgaacg 
ctgtgggaag 
cccttatggt 
ccattttcgg 
ttgtcatcaa 
cgaaactgca 
tgcatttgac 
aactcataga 
tgctctcttt 
ccaaaacgga 
tctga 



gatggaagat 
gaatggagcc 
ttttgtaaaa 
ggaaatccct 
agtggtgttg 
tgacgctact 
gatgatctgg 
agtgggcgct 
agaccatgag 
tgaccagata 
ttccacaggt 
caaagcattg 
caaaccctta 
caagcggaaa 
gcatatcaac 
aaccaaagaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1005 



<210> 128 
<211> 334 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from 



an environmental sample 



<400> 128 

Met He Ala Arg Lys Thr He Arg Ala Ala Ala Val Gin 

15 10 
Val Met Glu Asp Arg Lys Ala Thr Thr Asp Lys Val Cys 

20 25 
Gin Glu Ala Gly Glu Asn Gly Ala Glu He Val Val Phe 

35 40 45 

Phe lie Pro Asn Tyr Pro Tyr Phe -Ser Phe Val Lys Pro 

50 55 60 

Ala Gly Lys Asp His Leu Thr Leu Tyr Asp Gin Ala Val 
65 70 75 

Ser Pro Thr Thr Asp Gin Val Gly Ser Met Ala Lys Lys 

85 90 
Val Val Val Leu Gly Val Asn Glu Arg Ser His Gly Thr 

100 105 
Ala Gin He Val Phe Asp Ala Thr Gly Asp He Val Leu 



He Ala Pro 
15 

Ala Tyr He 
30 

Pro Glu Thr 

Pro Val Leu 

Glu He Pro 
80 

Trp Gly He 
95 

Leu Tyr Asn 
110 

Val Arg Arg 
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115 120 125 



Lys 


lie 
130 


Thr 


Pro 


l nr 


Tyr 


HIS 

135 


blU 


Arg 


Mai- 
ne L. 


Tip 
11c 


i rp 
140 




rii n 

VJ7-LX1 


Gly 


Asp 


t^iy 


Sex 


tjiy 


Leu 


Lys 


Hid 


V ell 


Asp 


X XXX. 




V d -L 


Gly 


Arg 


Val 


Gly 


Ala 












lOU 










1 cc; 
i J J 










160 


Leu 


TV 1 _ 

Ala 


Cys 


Trp 


blU 

165 


rllS 


Tyr 


Asn 


Pro 


Leu 
170 


Alo 


Arg 


iyr 


Ala 


Leu 
175 


Met 


Val 


Asp 


His 


Glu 

loU 


GlU 


Tin 

lie 


His 


Cys 


Ser 

1 ft R 


bin 


Jrne 


Pro 


uiy 


J- ;? \J 


Mpf 
rlc i— 


Val 


Gly 


Pro 


lie 
195 


Phe 


Gly 


Asp 


Gin 


lie 
200 


UilU 


vai 


mr 


T I Q 

lie 


Arg 
205 


Mia 
111 S 


XIX o 




Leu 


Glu 
210 


Ser 


Gly 


Cys 


Phe 


val 
zIj 


lie 


Asn 


Ser 


Thr 


biy 

OTA 


Trp 


Leu 


irilc 




Glu 


Gin 


lie 


Gin 


Ala 


T 1 - 

lie 


Thr 


Asp 


Asp 


Pro 


Lys 


Leu 


ills 


Lys 




Leu 


225 










o t n 










9 "3 ^ 










? 4 n 


Lys 


Asp 


Gly 


Cys 


Met 
245 


lnr 


Ala 


lie 


lie 


Ser 
250 


Pro 


pi,, 

ulU 


biy 


V ell 


n jl o 
255 


Leu 


Thr 


Lys 


Pro 


Leu 


Thr 


GlU 


Gly 


tjlU 


uiy 


lie 


lie 


Tyr 


fila 


Tyr 


Leu 


Asp 






260 










265 










270 






Met 


Lys 


Leu 
275 


He 


Asp 


Lys 


Arg 


Lys 
280 


Arg 


Met 


Met 


Asp 


Ser 
285 


Val 


Gly 


His 


Tyr 


Ala 


Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


Leu 


His 


lie 


Asn 


Asn 


Ala 


Glu 


Gin 


290 










295 










300 










Lys 


Pro 


Ala 


Val 


Tyr 


Thr 


Ser 


Pro 


Leu 


Thr 


Lys 


Thr 


Glu 


Thr 


Lys 


Glu 


305 










310 










315 










320 


Asp 


Val 


Lys 


Ser 


Tyr 
325 


Asp 


Arg 


Asn 


Lys 


Glu 
330 


Gin 


Leu 


He 


Val 







<210> 129 
<211> 1011 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 129 

atgtcagaaa agegaattat taaagegget 

catgatggaa ccgtaaagaa ggtttgtaat 

cagatcattg tattccctga aacctttatt 

ccaccagtga ctgctggcgc ggagcatttg 

ggtcccgtta ctcaagccat ttccgaacgt 

ggtgtaaatg agcgtgataa eggcagtcta 

ggtgagatgc ttctgaagag aagaaaaatc 

gggcaaggag atgettcagg cctgaaggtc 

ttggcatgct gggagcacta taaccctttg 

gaaattcact gtgetcaatt tccaggctcc 

gatgtcacga ttegtcatea tgccttggag 

tggttaactg atgatcagat caaatctatc 

agaggtggtt gcaacacggc cattatttct 

cgagaaggtg aaggcatggt gattgetgat 

agaatgatgg actcagttgg ccactacgcc 

gatgetcegg ctactccttc attccagatg 

cctatcgcag aggagcttaa aaatgacaac 

<210> 130 
<211> 336 
<212> PRT 
<213> Unknown - 

<220> 



gcagttcaaa tcacaccaga 
gtaattgatg aagcgggtgc 
ccatattacc catatttttc 
eggctctatg aaaaaagtgt 
geaegcatga ataatatggt 
tataacaccc agattatttt 
acacctacct ateatgageg 
gtcgataegg ctattgggcg 
gctagataca gcctcatgac 
atggttggtc agatcttege 
teaggttget tegtcatcaa 
accgacgatc ecaaaatgea 
ccagaaggga atcatttaac 
cttgatatgg cactcatcac 
agaccagaac tgttgagctt 
aacgaacatc gtcttaaatc 
aagcttagca gtggacagta 



ttttgaatcg 60 

taaaggtgta 12 0 

tttcatcact 180 

cgtgatacct 240 

tgttgtactt 300 

tgatgetace 360 

catgatttgg 420 

agteggagea 4 80 

acagcatgaa 54 0 

agatcaaatg 600 

ctccactggc 660 

gaaagcttta 72 0 

egagectttg 780 

caaacgaaaa 840 

agegatcaat 900 

agtgcaatta 960 

a 1011 
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<223> Obtained from an environmental sample 



<400> 130 



Met 


Ser 


Glu 


Lys 


Arg 


He 


He 


Lys 


Ala 


Ala 


Ala 


Val 


Gin 


He 


Thr 


Pro 


1 








5 










10 










15 




Asp 


Phe 


Glu 


Ser 

20 


His 


Asp 


Gly 


Thr 


Val 
25 


Lys 


Lys 


Val 


Cys 


Asn 
30 


Val 


He 


Asp 


Glu 


Ala 


Gly Ala 


Lys 


Gly 


Val 


Gin 


He 


He 


Val 


Phe 


Pro 


Glu 


Thr 






35 










40 










45 








Phe 


He 
50 


Pro 


Tyr 


Tyr 


Pro 


Tyr 
55 


Phe 


Ser 


Phe 


He 


Thr 
60 


Pro 


Pro 


Val 


Thr 


Ala 


Gly Ala 


Glu 


His 


Leu 


Arg 


Leu 


Tyr 


Glu 


Lys 


Ser 


Val 


Val 


He 


Pro 


65 










70 










75 










80 


Gly 


Pro 


Val 


Thr 


Gin 


Ala 


He 


Ser 


Glu 


Arg 


Ala 


Arg 


Met 


Asn 


Asn 


Met 








85 










90 










95 




Val 


Val 


Val 


Leu 


Gly Val 


Asn 


Glu 


Arg 


Asp 


Asn 


Gly 


Ser 


Leu 


Tyr 


Asn 








100 










105 










110 






Thr 


Gin 


He 
115 


He 


Phe 


Asp 


Ala 


Thr 
120 


Gly 


Glu 


Met 


Leu 


Leu 

125 


Lys 


Arg 


Arg 


Lys 


He 
130 


Thr 


Pro 


Thr 


Tyr 


His 
135 


Glu 


Arg 


Met 


He 


Trp 
140 


Gly 


Gin 


Gly 


Asp 


Ala 


Ser 


Gly 


Leu 


Lys 


Val 


Val 


Asp 


Thr 


Ala 


He 


Gly Arg 


Val 


Gly 


Ala 


145 










150 










155 










160 


Leu 


Ala 


Cys 


Trp 


Glu 


His 


Tyr 


Asn 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ser 


Leu 


Met 








165 










170 










175 




Thr 


Gin 


His 


Glu 
180 


Glu 


He 


His 


Cys 


Ala 
185 


Gin 


Phe 


Pro 


Gly 


Ser 
190 


Met 


Val 


Gly 


Gin 


He 


Phe 


Ala 


Asp 


Gin 


Met 


Asp 


Val 


Thr 


He 


Arg 


His 


His 


Ala 




195 










200 










205 








Leu 


Glu 
210 


Ser 


Gly 


Cys 


Phe 


Val 
215 


He 


Asn 


Ser 


Thr 


Gly 
220 


Trp 


Leu 


Thr 


Asp 


Asp 


Gin 


He 


Lys 


Ser 


He 


Thr 


Asp 


Asp 


Pro 


Lys 


Met 


Gin 


Lys 


Ala 


Leu 


225 










230 










235 










240 


Arg 


Gly 


Gly 


Cys 


Asn 
245 


Thr 


Ala 


He 


He 


Ser 
250 


Pro 


Glu 


Gly 


Asn 


His 
255 


Leu 


Thr 


Glu 


Pro 


Leu 
260 


Arg 


Glu 


Gly 


Glu 


Gly 
265 


Met 


Val 


He 


Ala 


Asp 
270 


Leu 


Asp 


Met 


Ala 


Leu 
275 


He 


Thr 


Lys 


Arg 


Lys 
280 


Arg 


Met 


Met 


Asp 


Ser 
285 


Val 


Gly 


His 


Tyr 


Ala 


Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


Leu 


Ala 


He 


Asn 


Asp 


Ala 


Pro 


Ala 


290 








295 










300 










Thr 


Pro 


Ser 


Phe 


Gin 


Met 


Asn 


Glu 


His 


Arg 


Leu 


Lys 


Ser 


Val 


Gin 


Leu 


305 










310 










315 










320 


Pro 


He 


Ala 


Glu 


Glu 

325 


Leu 


Lys 


Asn 


Asp 


Asn 
330 


Lys 


Leu 


Ser 


Ser 


Gly 
335 


Gin 



<210> 131 
<211> 1011 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 131 

atgtcagaaa 

catgatggaa 

cagatcattg 

ccaccagtga 

ggtcccgtta 

ggtgtaaatg 



agcgaattat 
ccgtaaagaa 
tattccctga 
ctgctggcgc 
ctcaagacat 
agcgtgataa 



taaagcggct 
ggtttgtaat 
aacctttatt 
ggagcatttg 
ttccgaacgt 
cggcagtcta 



gcagttcaaa 
gtaattgatg 
ccatattacc 
cggctctatg 
gcacgcatga 
tataacaccc 



tcacaccaga 
aagcgggtgc 
catatttttc 
aaaaaagtgt 
ataatatggt 
agattatttt 



ttttgaatcg 
taaaggtgta 
tttcatcact 
cgtgatacct 
tgttgtactt 
tgatgctacc 



60 
120 
180 
240 
300 
360 
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ggtgagatgc ttctgaagag aagaaaaatc acacctacct atcatgagcg catgatttgg 420 

gggcaaggag atgcttcagg cctgaaggtc gtcgatacgg ctattgggcg agtcggagca 4 80 

ttggcatgct gggagcacta taaccctttg gctagataca gcctcatgac acagcatgaa 540 

gaaattcact gtgctcaatt tccaggctcc atggttggtc agatcttcgc agatcaaatg 600 

gatgtcacga ttcgtcatca tgccttggag tcaggttgct tcgtcatcaa ctccactggc 660 

tggttaactg atgatcagat caaatctatc accgacgatc ccaaaatgca gaaagcttta 720 

agaggtggtt gcaacacggc cattatttct ccagaaggga atcatttaac cgagcctttg 7 80 

cgagaaggtg aaggcatggt gattgctgat cttgatatgg cactcatcac caaacgaaaa 840 

agaatgatgg actcagttgg ccactacgcc agaccagaac tgttgagctt agcgatcaat 900 

gatgctccgg ctactccttc attccagatg aacgaacatc gtcttaaatc agtgcaatta 960 

cctatcgcag aggagcttaa aaatgacaac aagcttagca gtggacagta a 1011 



<210> 132 
<211> 336 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 132 



Met 


Ser 


Glu 


Lys 


Arg 


He 


He 


Lys 


Ala 


Ala 


Ala 


Val 


Gin 


He 


Thr 


Pro 


1 








5 










10 










15 




Asp 


Phe 


Glu 


Ser 


His 


Asp 


Gly 


Thr 


Val 


Lys 


Lys 


Val 


Cys 


Asn 


Val 


He 








20 










25 










30 






Asp 


Glu 


Ala 


Gly 


Ala 


Lys 


Gly 


Val 


Gin 


He 


He 


Val 


Phe 


Pro 


Glu 


Thr 






35 










40 










45 








Phe 


lie 


Pro 


Tyr 


Tyr 


Pro 


Tyr 


Phe 


Ser 


Phe 


He 


Thr 


Pro 


Pro 


Val 


Thr 




50 










55 










60 










Ala 


Gly 


Ala 


Glu 


His 


Leu 


Arg 


Leu 


Tyr 


Glu 


Lys 


Ser 


Val 


Val 


He 


Pro 


65 








70 










75 










80 


Gly 


Pro 


Val 


Thr 


Gin 


Asp 


He 


Ser 


Glu 


Arg 


Ala 


Arg 


Met 


Asn 


Asn 


Met 








85 










90 










95 




Val 


Val 


Val 


Leu 


Gly 


Val 


Asn 


Glu 


Arg 


Asp 


Asn 


Gly 


Ser 


Leu 


Tyr 


Asn 








100 










105 










110 






Thr 


Gin 


He 


He 


Phe 


Asp 


Ala 


Thr 


Gly 


Glu 


Met 


Leu 


Leu 


Lys 


Arg 


Arg 






115 










120 










125 








Lys 


He 


Thr 


Pro 


Thr 


Tyr 


His 


Glu 


Arg 


Met 


He 


Trp 


Gly 


Gin 


Gly 


Asp 


130 










135 










140 










Ala 


Ser 


Gly 


Leu 


Lys 


Val 


Val 


Asp 


Thr 


Ala 


He 


Gly 


Arg 


Val 


Gly Ala 


145 










150 










155 










160 


Leu 


Ala 


Cys 


Trp 


Glu 


His 


Tyr 


Asn 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ser 


Leu 


Met 








165 










170 










175 




Thr 


Gin 


His 


Glu 


Glu 


He 


His 


Cys 


Ala 


Gin 


Phe 


Pro 


Gly 


Ser 


Met 


Val 








180 










185 










190 






Gly 


Gin 


He 


Phe 


Ala 


Asp 


Gin 


Met 


Asp 


Val 


Thr 


He 


Arg 


His 


His 


Ala 




195 










200 










205 








Leu 


Glu 


Ser 


Gly 


Cys 


Phe 


Val 


He 


Asn 


Ser 


Thr 


Gly 


Trp 


Leu 


Thr 


Asp 




210 










215 










220 










Asp 


Gin 


He 


Lys 


Ser 


He 


Thr 


Asp 


Asp 


Pro 


Lys 


Met 


Gin 


Lys 


Ala 


Leu 


225 










230 










235 










240 


Arg 


Gly 


Gly 


Cys 


Asn 


Thr 


Ala 


He 


He 


Ser 


Pro 


Glu 


Gly Asn 


His 


Leu 






245 










250 










255 




Thr 


Glu 


Pro 


Leu 


Arg 


Glu 


Gly 


Glu 


Gly Met 


Val 


He 


Ala 


Asp 


Leu 


Asp 








260 










265 










270 






Met 


Ala 


Leu 


He 


Thr 


Lys 


Arg 


Lys 


Arg 


Met 


Met 


Asp 


Ser 


Val 


Gly 


His 






275 










280 










285 








Tyr 


Ala 


Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


Leu 


Ala 


He 


Asn 


Asp 


Ala 


Pro 


Ala 


290 










295 










300 










Thr 


Pro 


Ser 


Phe 


Gin 


Met 


Asn 


Glu 


His 


Arg 


Leu 


Lys 


Ser 


Val 


Gin 


Leu 


305 










310 










315 










320 
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Pro lie Ala Glu Glu Leu Lys Asn Asp Asn Lys Leu Ser Ser Gly Gin 
325 330 335 



<210> 133 
<211> 1026 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 133 

atgtcgacca 

ccggagggca 

cagctcatcg 

gcgccggtcg 

ggtccagtaa 

ggtgtaaacg 

ggccgtctcg 

gggcagggcg 

ctcgcctgct 

gagattcatt 

gaggtcacga 

tggctgacac 

agcgggggct 

cgagaaggtg 

cgcatgatgg 

agccgcgccg 

accagccacg 

ggctga 



agcggatcgt 
cactggagaa 
tcttccccga 
cgatgggtgc 
cggccaccgt 
agcgcgatca 
tcctcaaacg 
acggcagcgg 
gggagcacta 
gcgcgcagtt 
tccgccatca 
ccgaacagat 
gcaacaccgc 
agggcatggt 
attcggtggg 
cttcgccgat 
atgtccagcc 



acgcgccgct 
ggtttgcgcg 
gaccttcgta 
cgagcacatg 
ggcggagcgg 
cggctcactg 
ccgcaagatc 
ccttaaggtt 
caacccgctc 
tccgggctcg 
cgcgctggag 
cgcgtcgatc 
gatcatctcg 
cgtggccgac 
tcactacgct 
gtcgtcacaa 
agcaactgat 



gccgttcagc 
gccatcgaca 
ccgtactacc 
cggttatacg 
gcaaaagcgc 
tataacgcgc 
actccgacct 
gtagacaccg 
gcgcgctatg 
atggtggggc 
tcgggctgct 
acaccggacg 
ccggagggcg 
ctcgacatgg 
cgcccggagt 
atggaaatac 
gaccgagctc 



tggcaccgga 
aggcggcggg 
cgtacttctc 
agcgcgcggt 
acgcgatggt 
aactgatctt 
atcacgagcg 
gtatcggcag 
cgctcatggc 
cgatcttcgc 
tcgtcgtcaa 
cgggtctgca 
tgcacctggc 
cgctcatcac 
tgctcagcct 
ccgggagctt 
cagtcctccg 



tctggagcgg 
ggacggtgtg 
tttcgtgcgt 
agcggtgccc 
cgtcgtgctg 
cgacgagacc 
catggtgtgg 
gatcggagcc 
gcagcacgaa 
ggatcagatc 
tgcgaccggc 
aaaggcaatc 
cccgccgttg 
caaacgcaaa 
gcgcatcgat 
gcatgaaatc 
gcttgaggtt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1026 



<210> 134 
<211> 341 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 134 



Met 


Ser 


Thr 


Lys 


Arg 


He 


Val 


Arg 


Ala 


Ala 


Ala 


Val 


Gin 


Leu 


Ala 


Pro 


1 








5 










10 










15 




Asp 


Leu 


Glu 


Arg 


Pro 


Glu 


Gly 


Thr 


Leu 


Glu 


Lys 


Val 


Cys 


Ala 


Ala 


He 








20 










25 










30 






Asp 


Lys 


Ala 


Ala 


Gly 


Asp 


Gly 


Val 


Gin 


Leu 


He 


Val 


Phe 


Pro 


Glu 


Thr 






35 










40 










45 








Phe 


Val 


Pro 


Tyr 


Tyr 


Pro 


Tyr 


Phe 


Ser 


Phe 


Val 


Arg 


Ala 


Pro 


Val 


Ala 




50 










55 










60 










Met 


Gly 


Ala 


Glu 


His 


Met 


Arg 


Leu 


Tyr 


Glu 


Arg 


Ala 


Val 


Ala 


Val 


Pro 


65 










70 










75 










80 


Gly 


Pro 


Val 


Thr 


Ala 


Thr 


Val 


Ala 


Glu 


Arg 


Ala 


Lys 


Ala 


His 


Ala 


Met 










85 










90 










95 




Val 


Val 


Val 


Leu 


Gly 


Val 


Asn 


Glu 


Arg 


Asp 


His 


Gly 


Ser 


Leu 


Tyr 


Asn 








100 










105 










110 






Ala 


Gin 


Leu 


He 


Phe 


Asp 


Glu 


Thr 


Gly 


Arg 


Leu 


Val 


Leu 


Lys 


Arg 


Arg 






115 










120 










125 








Lys 


He 


Thr 


Pro 


Thr 


Tyr 


His 


Glu 


Arg 


Met 


Val 


Trp 


Gly 


Gin 


Gly 


Asp 




130 










135 










140 










Gly 


Ser 


Gly 


Leu 


Lys 


Val 


Val 


Asp 


Thr 


Gly 


He 


Gly 


Arg 


He 


Gly 


Ala 


145 










150 










155 










160 


Leu 


Ala 


Cys 


Trp 


Glu 


His 


Tyr 


Asn 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ala 


Leu 


Met 
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Ala Gin His Glu 
180 

Gly Pro lie Phe 
195 

Leu Glu Ser Gly 
210 

Glu Gin lie Ala 
225 

Ser Gly Gly Cys 

Ala Pro Pro Leu 
260 

Met Ala Leu lie 
275 

Tyr Ala Arg Pro 
290 

Ser Pro Met Ser 
305 

Thr Ser His Asp 

Arg Leu Glu Val 
340 



165 

Glu lie His Cys 

Ala Asp Gin lie 
200 

Cys Phe Val Val 
215 

Ser lie Thr Pro 
230 

Asn Thr Ala lie 
245 

Arg Glu Gly Glu 

Thr Lys Arg Lys 
280 

Glu Leu Leu Ser 
295 

Ser Gin Met Glu 
310 

Val Gin Pro Ala 

325 

Gly 



170 

Ala Gin Phe Pro 
185 

Glu Val Thr lie 

Asn Ala Thr Gly 

220 

Asp Ala Gly Leu 
235 

lie Ser Pro Glu 
250 

Gly Met Val Val 
265 

Arg Met Met Asp 

Leu Arg lie Asp 
300 

lie Pro Gly Ser 
315 

Thr Asp Asp Arg 
330 







175 




Glv 


Ser 


Met 


Val 




190 






Arg 


His 


His 


Ala 


205 








Tro 

c 


Leu 


Thr 


Pro 


Gin 


Lys 


Ala 


lie 








240 




Val 


His 


Leu 






255 




Ala 


Asp 




Asp 










Ser 


Val 


Gly 


His 


285 








Ser 


Arg 


Ala 


Ala 


Leu 


His 


Glu 


He 








320 


Ala 


Pro 


Val 


Leu 






335 





<210> 135 
<211> 1011 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 135 

atgtcagaca 

gcagatggaa 

caaattattg 

cctccagtca 

ggcccggtta 

ggtgtcaacg 

ggcgaactgc 

gggcaaggtg 

ttggcttgtt 

gaaattcact 

gatgtcacta 

tggctcaccg 

cgtggcggct 

cgtgaaggag 

agaatgatgg 

gacaaaccag 

cgaatcgcag 



agcgaatcat 
ccgttaagaa 
tattcccgga 
ctgctggcgc 
cccaagcgat 
agcgagataa 
tacttaaaag 
atgcatcagg 
gggaacacta 
gtgcgcaatt 
tccgacatca 
acgcgcaaat 
gcaacacagc 
aaggcatggt 
attcggtagg 
caactacaac 
aggagttgaa 



taaagcggct 
agtgtgcaag 
aaccttcatc 
tgagcattta 
tgccgagcga 
cggtagcctc 
aagaaaaatc 
tctcaaagta 
caacccactg 
cccaggctct 
cgccttagag 
ccaatcgatt 
catcatctcc 
gattgccaat 
ccattattca 
attttcaatg 
aaatgacgac 



gcagttcaaa 
gtaatcgatg 
ccttactacc 
aagctttatg 
gccagggtta 
tacaacacac 
acccctacct 
gttgaaacag 
gccagatatg 
atggttggcc 
tcaggctgct 
actgatgacc 
cccgaagggg 
cttgatatgg 
agaccagaat 
actgaggggc 
aagcttagca 



tcactcctga 
aagcaggtgc 
catacttttc 
agaaaagtgt 
atcagatggt 
aattgatctt 
accatgaacg 
agatcgcccg 
cactcatgac 
agatatttgc 
tcgtcatcaa 
caaaaatgca 
tgcacttaac 
cactcatcac 
tattaagcct 
gtactcaaac 
ctggaaacta 



ctttgactca 
aaagggagtt 
attcattaca 
cgtgatacct 
tgtcgtgctt 
tgataccaac 
tatgatctgg 
agtaggtgcc 
acagcatgaa 
cgatcagatg 
cgccactggc 
aaaagcatta 
agagccacta 
aaaacgaaaa 
ggcaattaac 
agagccattt 



<210> 136 
<211> 336 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1011 



<400> 136 

Met Ser Asp Lys Arg He He Lys Ala Ala Ala Val Gin He Thr Pro 
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1 5 



Asp 


Phe 


Asp 


Ser 


Ala 


Asp 


Glv 


Thr 








20 










Asp 


Glu 


Ala 


Gly 


Ala 


Lys 


Glv 


Val 






35 










40 


Phe 


lie 


Pro 


Tvr 
i yj. 


xy j_ 


Pro 


Tvr 


Phe 




50 










55 




Ala 


Gly 


Ala 


Glu 


His 


Leu 


Lys 


Leu 


65 










70 






Gly 


IT J_ \J 


Val 


Thr 


Gin 


Ala 


He 


Ala 










85 








V dJL 


V aX 


v d x 


Leu 


Gly 


Val 


Asn 


Glu 








100 










± ilx. 


C. 1 Tl 
VJ _LI 1 


.Lieu 


He 


Phe 


Asp 


Thr 


Asn 






115 










120 


T 

Ly s 




Th t 


Pro 


Thr 


T vr 


His 


Glu 




130 










135 




AT ^ 


Q d y- 


u J.y 


Leu 


Lys 


Val 


Val 


Glu 


145 










150 








Ala 


Cy s 


Ttt> 


Glu 


His 


Tvr 


Asn 










165 








Thr 


Gin 


His 


Glu 


Glu 


lie 


His 


CyS 








180 










Gly 


Gin 


lie 


Phe 


Ala 


Asp 


Gin 


Met 






195 










200 


Leu 


Glu 


Ser 


Gly 


Cys 


Phe 


Val 


He 




210 










215 




Ala 


Gin 


lie 


Gin 


Ser 


He 


Thr 


Asp 


225 










230 






Arg 


Gly 


Gly 


Cys 


Asn 


Thr 


Ala 


He 










245 








Thr 


Glu 


Pro 


Leu 


Arg 


Glu 


Gly 


Glu 








260 










Met 


Ala 


Leu 


He 


Thr 


Lys 


Arg 


Lys 






275 










280 


Tyr 


Ser 


Arg 


Pro 


Glu 


Leu 


Leu 


Ser 




290 










295 




Thr 


Thr 


Thr 


Phe 


Ser 


Met 


Thr 


Glu 


305 










310 






Arg 


lie 


Ala 


Glu 


Glu 


Leu 


Lys 


Asn 



325 





10 










15 




Val 


Lys 


Lys 


Val 


Cys 


Lys 


Val 


He 


25 










30 






Gin 


He 


He 


Val 


Phe 


Pro 


Glu 


Thr 










45 








Ser 


Phe 


He 


Thr 


Pro 


Pro 


Val 


Thr 








60 










Tvr 


Glu 


Lys 


Ser 


Val 


Val 


He 


Pro 






75 










80 


Glu 


Arg 


Ala 


Arg 


Val 


Asn 


Gin 


Met 




90 










95 




Arg 


Asp 


Asn 


Gly 


Ser 


Leu 


Tvr 


Asn 


1U J 










110 






Gly 


Glu 


Leu 


Leu 


Leu 


Lys 


Arg 


Arg 










12 5 








Arg 


Met 


He 


frn 
1J -r 


Gly 


Gin 


Gly 


Asp 








140 










Thr 


Glu 


He 


Ala 


Arg 


Val 


Gly 


Ala 






155 










160 


Pro 


Leu 


Ala 


Arg 


Tvr 
j - 1 - 


Ala 


Leu 


Met 




170 










175 




Ala 


Gin 


Phe 


Pro 


Gly 


Ser 


Met 


Val 


185 










190 






Asp 


Val 


Thr 


He 


Arg 


His 


His 


Ala 










205 








Asn 


Ala 


Thr 


Glv 


Trp 


Leu 


Thr 


Asp 








220 










Asp 


Pro 


Lys 


Met 


Gin 


Lys 


Ala 


Leu 






235 










240 


He 


Ser 


Pro 


Glu 


Gly Val 


His 


Leu 




250 










255 




Gly Met 


Val 


He 


Ala 


Asn 


Leu 


Asp 


265 










270 






Arg 


Met 


Met 


Asp 


Ser 


Val 


Gly 


His 










285 








Leu 


Ala 


He 


Asn 


Asp 


Lys 


Pro 


Ala 








300 










Gly Arg 


Thr 


Gin 


Thr 


Glu 


Pro 


Phe 






315 










320 


Asp 


Asp 


Lys 


Leu 


Ser 


Thr 


Gly 


Asn 




330 










335 





<210> 137 
<211> 978 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 137 

atggctattg tcaaggccgc ggcggtgcag atcagtccgg tgctctacag tcgcgccggc 60 

acagtggaca aggtcgtcgc gaagatccgc gagctgggcc gacgaggggt cgagttcgcc 120 

gtcttccccg agaccgtcat tccctactat ccctacttct ctttcgtgca gcccccctac 180 

acccaggcca ccgaacacct gcgcctgctc gaggaatcgg tgaccgtgcc ctccgccgaa 2 40 

accgacgcga tcgccaaggc cgctcgcgag gcgggcatgg tcgtctccat cggcgtcaac 300 

gagcgcgacg gcggaaccat ctacaacacc caactcctct tcgacgccga cggcactctc 360 

atccagcgcc gccgcaagat cacccccacc tatcacgaac gcatggtctg ggggcaggga 4 20 

gacggctcag gtctgcgcgc cgtcgacagt gcggtcggcc gcatcggcca gctcgcctgc 4 80 

tgggagcact accagccact ggcccggtac gccctcatcg ctgacggcga gcagatccac 540 

gccgcgatgt accccggcgc cttcggcggc gatctgttcg ccgagcagat cgaggtcaac 6 00 
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atccgccagc acgccctgga atccgccagc ttcgtcgtca acgccaccgc ctggctcgac 660 

gccgatcagc aggcccagat cgccaaggac accggaggcc cggtcccggc cttctccggt 720 

ggcttcttca ccgccatcgt cgaccccgaa ggccgtatca tcggcgaccc cctcaccagc 78 0 

ggcgaaggcg aagtgatcgc cgacctcgat ctcgctctca tcaaccgccg caagcgcctc 840 

atggacgcca gtggacacta ccagccgccc gaaattctta gcttcacatt gaccggtgca 900 

ccggcgcctt atgtcaagag cgcggcgtgc cggggaaccc cgggtacgac cgtggccgag 9 60 

gagggacggt ccgcttag 978 

<210> 138 
<211> 325 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 138 

Met Ala He Val Lys Ala Ala Ala Val Gin He Ser Pro Val Leu Tyr 

15 10 15 

Ser Arg Ala Gly Thr Val Asp Lys Val Val Ala Lys He Arg Glu Leu 

20 25 30 

Gly Arg Arg Gly Val Glu Phe Ala Val Phe Pro Glu Thr Val He Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser Phe Val Gin Pro Pro Tyr Thr Gin Ala Thr 

50 55 60 

Glu His Leu Arg Leu Leu Glu Glu Ser Val Thr Val Pro Ser Ala Glu 
65 70 75 80 

Thr Asp Ala He Ala Lys Ala Ala Arg Glu Ala Gly Met Val Val Ser 

85 90 95 

He Gly Val Asn Glu Arg Asp Gly Gly Thr He Tyr Asn Thr Gin Leu 

100 105 HO 

Leu Phe Asp Ala Asp Gly Thr Leu He Gin Arg Arg Arg Lys He Thr 

115 120 125 

Pro Thr Tyr His Glu Arg Met Val Trp Gly Gin Gly Asp Gly Ser Gly 

130 135 140 

Leu Arg Ala Val Asp Ser Ala Val Gly Arg He Gly Gin Leu Ala Cys 
145 150 155 160 

Trp Glu His Tyr Gin Pro Leu Ala Arg Tyr Ala Leu He Ala Asp Gly 

165 170 175 

Glu Gin He His Ala Ala Met Tyr Pro Gly Ala Phe Gly Gly Asp Leu 

180 185 190 

Phe Ala Glu Gin He Glu Val Asn He Arg Gin His Ala Leu Glu Ser 

195 200 205 

Ala Ser Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp Gin Gin 

210 215 220 

Ala Gin He Ala Lys Asp Thr Gly Gly Pro Val Pro Ala Phe Ser Gly 
225 230 235 240 

Gly Phe Phe Thr Ala He Val Asp Pro Glu Gly Arg He He Gly Asp 

245 250 255 

Pro Leu Thr Ser Gly Glu Gly Glu Val He Ala Asp Leu Asp Leu Ala 

260 265 270 

Leu He Asn Arg Arg Lys Arg Leu Met Asp Ala Ser Gly His Tyr Gin 

275 280 285 

Pro Pro Glu He Leu Ser Phe Thr Leu Thr Gly Ala Pro Ala Pro Tyr 

290 295 300 

Val Lys Ser Ala Ala Cys Arg Gly Thr Pro Gly Thr Thr Val Ala Glu 
305 310 315 320 

Glu Gly Arg Ser Ala 
325 

<210> 139 
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<211> 999 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 139 

atgaaaacaa cggttaccgt tgcctgcgtt caggccgccc ccgtatttat ggatttagaa 60 

ggcaccgtag ataaaacaat caccctcatc tctgaagccg cacagaaagg cgcggagctc 120 

atcgcttttc cggagacctg gatacccggt tacccgtggt tcttatggct gaactcgccc 180 

gccacaaata tgcccctggt ttatcagtat catcagaact ctctggtgct ggacagtacc 2 40 

caggcgaagc gaattgcgga tgcggcacgg cagaataaca tcactgtcgc tctgggcttc 300 

agcgaacgcg atcatggaag cctctatatc gcacagtggc tgattggcag cgacggggag 360 

accattggca tccggcgcaa gctcaaggcc acgcacgtgg agcgtacgct gttcggcgaa 420 

agcgacggct cctccctgac cacctgggag acacctctgg gtaacgtcgg ggccctctgc 4 80 

tgctgggagc acctgcagcc gctgtcccgc tatgcaatgt attcccagca tgaggagatc 54 0 

cacatcgctg cctggcccag tttcagtctc tacaccagtg caacggccgc actgggtcct 600 

gacgtcaata cggcggcttc acgcctctat gccgcggagg ggcagtgctt cgtgatagcc 660 

ccgtgtgccg tggtttctga tgaaatgatt gatttactct gtcctgatga tgaccggaga 72 0 

gcgttactca gtgccggagg gggacatgcc cgtatttacg gcccggacgg aagagaactc 7 80 

gtcacccctc tcggggaaaa tgaggaagga ctgcttatcg ctgagctcga ctctgctgcg 8 40 

attacctttg ccaaactggc ggcagacccg gttggccact attcccgtcc tgacgtgacc 900 

cgcctccttt ttaatccttc agccaacaag actgtgatta aacgacattc gcctcctgag 960 

ttaattgccg agcagactgc agaagaagag gaggagtag 999 



<210> 140 
<211> 332 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 140 



Met 


Lys 


Thr 


Thr 


Val 


Thr 


Val 


Ala 


Cys 


Val 


Gin 


Ala 


Ala 


Pro 


Val 


Phe 


1 






5 










10' 










15 




Met 


Asp 


Leu 


Glu 


Gly 


Thr 


Val 


Asp 


Lys 


Thr 


He 


Thr 


Leu 


He 


Ser 


Glu 






20 










25 










30 






Ala 


Ala 


Gin 


Lys 


Gly 


Ala 


Glu 


Leu 


He 


Ala 


Phe 


Pro 


Glu 


Thr 


Trp 


He 






35 










40 










45 








Pro 


Gly 


Tyr 


Pro 


Trp 


Phe 


Leu 


Trp 


Leu 


Asn 


Ser 


Pro 


Ala 


Thr 


Asn 


Met 




50 










55 










60 










Pro 


Leu 


Val 


Tyr 


Gin 


Tyr 


His 


Gin 


Asn 


Ser 


Leu 


Val 


Leu 


Asp 


Ser 


Thr 


65 








70 










75 










80 


Gin 


Ala 


Lys 


Arg 


He 


Ala 


Asp 


Ala 


Ala 


Arg 


Gin 


Asn 


Asn 


He 


Thr 


Val 






85 










90 










95 




Ala 


Leu 


Gly 


Phe 


Ser 


Glu 


Arg 


Asp 


His 


Gly 


Ser 


Leu 


Tyr 


He 


Ala 


Gin 






100 










105 










110 






Trp 


Leu 


He 


Gly 


Ser 


Asp 


Gly 


Glu 


Thr 


He 


Gly 


He 


Arg 


Arg 


Lys 


Leu 




115 








120 










125 








Lys 


Ala 


Thr 


His 


Val 


Glu 


Arg 


Thr 


Leu 


Phe 


Gly 


Glu 


Ser 


Asp 


Gly 


Ser 


130 










135 










140 










Ser 


Leu 


Thr 


Thr 


Trp 


Glu 


Thr 


Pro 


Leu 


Gly 


Asn 


Val 


Gly Ala 


Leu 


Cys 


145 








150 










155 










160 


Cys 


Trp 


Glu 


His 


Leu 


Gin 


Pro 


Leu 


Ser 


Arg 


Tyr 


Ala 


Met 


Tyr 


Ser 


Gin 






165 










170 










175 




His 


Glu 


Glu 


He 


His 


He 


Ala 


Ala 


Trp 


Pro 


Ser 


Phe 


Ser 


Leu 


Tyr 


Thr 








180 










185 










190 






Ser 


Ala 


Thr 


Ala 


Ala 


Leu 


Gly 


Pro 


Asp 


Val 


Asn 


Thr 


Ala 


Ala 


Ser 


Arg 



195 200 205 
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Leu 


Tyr 


Ala 


Ala 


Glu 


Gly 


Gin 


Cys 


Phe 


Val 


He 


Ala 


Pro 


Cys 


Ala 


Val 




210 










215 










220 








Arg 


Val 


Ser 


Asp 


Glu 


Met 


He 


Asp 


Leu 


Leu 


Cys 


Pro 


Asp 


Asp 


Asp 


Arg 


225 








230 










235 










240 


Ala 


Leu 


Leu 


Ser 


Ala 


Gly 


Gly 


Gly 


His 


Ala 


Arg 


He 


Tyr 


Gly 


Pro 


Asp 








245 








250 










255 




Gly 


Arg 


Glu 


Leu 
260 


Val 


Thr 


Pro 


Leu 


Gly 
265 


Glu 


Asn 


Glu 


Glu 


Gly 
270 


Leu 


Leu 


He 


Ala 


Glu 


Leu 


Asp 


Ser 


Ala 


Ala 


He 


Thr 


Phe 


Ala 


Lys 


Leu 


Ala 


Ala 






275 










280 










285 






Phe 


Asp 


Pro 
290 


Val 


Gly 


His 


Tyr 


Ser 
295 


Arg 


Pro 


Asp 


Val 


Thr 
300 


Arg 


Leu 


Leu 


Asn 
305 


Pro 


Ser 


Ala 


Asn 


Lys 
310 


Thr 


Val 


He 


Lys 


Arg 
315 


His 


Ser 


Pro 


Pro 


Glu 
320 


Leu 


He 


Ala 


Glu 


Gin 


Thr 


Ala 


Glu 


Glu 


Glu 


Glu 


Glu 
















325 










330 















<210> 141 
<211> 1026 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 141 

atggtgttca 

atagataagg 

tttccggaaa 

caggtggaag 

acccggcttc 

cgaatggctg 

ctgctcggcg 

ggcggtggtt 

tgctgggagc 

catgcggctg 

cagatcgacg 

ggcaatcccg 

tacgtcaccg 

ggccctcaca 

aagttcgtca 

cttggcgtca 

accgagttgc 

caatag 



aggcagcgac 
ctatcgacct 
cgtttattcc 
cgctcgccca 
agcaggtagc 
agacccgaac 
tgcatcgcaa 
atacgctgag 
acacgatgaa 
catggcccgg 
ccatgatgaa 
tcgaccggac 
agggcggcgg 
cgggccttga 
aagtctggct 
accaaaagca 
tggaggcgga 



tgttcatgca 

catcaagaag 

gggctatccg 

gtacactgac 

caaggcggca 

ctgcttcaac 

gctgcagccg 

gacctacaag 

cctcgcgcgg 

actatcgacg 

gactcacgcg 

ctgcctcgac 

atggagcgcc 

ggaaaagctg 

cgacagcaaa 

gatttggcct 

tatcatcgaa 



gctccggtat 

gccggtcagg 

tattttatcg 

gcttccgtgg 

ggcgttgcag 

tcgcaggtgt 

acttatgccg 

agctcgcttg 

caggccctga 

atgcgaggtt 

cttaccgcac 

tggatggaaa 

gttatccatc 

gtcgtcggcg 

gggcactatg 

gatgaacatt 

taccccttgc 



tcatggacaa 

aagggattaa 

aatgctatcc 

agatcgacgg 

tcgtcatggg 

tcattgacgt 

agcgcaaggt 

gcgtgctcgg 

tcatgcagag 

tcgagcccgt 

agtgctgggt 

agaacatcgg 

cgttcaacag 

agatcaatct 

ctcggccgga 

tgctggcgcg 

aactgttgca 



ggaagcgtcg 

gcttctggtt 

gccgcttgcg 

cccggaagtc 

catcagcgaa 

cgacggcacg 

atgggcacag 

cggtcttgcc 

cgagcagatc 

ggccgatatc 

gctttcggcc 

accgcaggat 

ctatctcggc 

ggacgatctc 

aatcctgaaa 

gcaggatgtg 

agaccgcgcg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1026 



<210> 142 
<211> 341 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 

Set°Val 4 phe Lys Ala Ala Thr Val His Ala Ala Pro Val Phe Met Asp 

15 10 15 

Lys Glu Ala Ser He Asp Lys Ala He Asp Leu He Lys Lys Ala Gly 

20 25 30 

Gin Glu Gly He Lys Leu Leu Val Phe Pro Glu Thr Phe He Pro Gly 

35 40 45 

Tyr Pro Tyr Phe He Glu Cys Tyr Pro Pro Leu Ala Gin Val Glu Ala 
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50 55 60 



Leu 


Ala 


Gin 


Tyr 


Thr 


Asp 


Ala 


Ser 


Val 


Glu 


He 


Asp 


Gly 


Pro 


C* 1 i i 

CjlU 


Val 


65 










70 










/ D 










Q Pi 


Thr 


Arg 


Leu 


Gin 


Gin 


Val 


Ala 


Lys 


Ala 


Ala 


Gly 


Val 


Ala 


Val 


Val 


Met 










85 










90 










n c 
^^ 




Gly 


lie 


Ser 


Glu 


Arg 


Met 


Ala 


Glu 


Thr 


Arg 


Thr 


Cys 


Phe 


Asn 


Ser 


Gin 








100 










105 










110 






Val 


Phe 


lie 


Asp 


Val 


Asp 


Gly 


Thr 


Leu 


Leu 


Gly 


Val 


His 


Arg 


Lys 


Leu 






115 










120 










125 








Gin 


Pro 


Thr 


Tyr 


Ala 


Glu 


Arg 


Lys 


Val 


Trp 


Ala 


Gin 


Gly 


Gly 


Gly 


Tyr 




130 










135 










140 










Thr 


Leu 


Arg 


Thr 


Tyr 


Lys 


Ser 


Ser 


Leu 


Gly 


Val 


Leu 


Gly 


Gly 


Leu 


Ala 


145 










150 










155 










160 


Cys 


Trp 


Glu 


His 


Thr 


Met 


Asn 


Leu 


Ala 


Arg 


Gin 


Ala 


Leu 


He 


Met 


Gin 










165 










170 










175 




Ser 


Glu 


Gin 


lie 


His 


Ala 


Ala 


Ala 


Trp 


Pro 


Gly 


Leu 


Ser 


Thr 


Met 


Arg 








180 










185 










190 






Gly 


Phe 


Glu 


Pro 


Val 


Ala 


Asp 


lie 


Gin 


He 


Asp 


Ala 


Met 


Met 


Lys 


Thr 




195 










200 










2 05 








His 


Ala 


Leu 


Thr 


Ala 


Gin 


Cys 


Trp 


Val 


Leu 


Ser 


Ala 


Gly 


Asn 


Pro 


Val 




210 










215 










220 










Asp 


Arg 


Thr 


Cys 


Leu 


Asp 


Trp 


Met 


Glu 


Lys 


Asn 


He 


Gly 


Pro 


Gin 


Asp 


225 










230 










235 










240 


Tyr 


Val 


Thr 


Glu 


Gly 


Gly 


Gly 


Trp 


Ser 


Ala 


Val 


He 


His 


Pro 


Phe 


Asn 










245 










250 










255 




Ser 


Tyr 


Leu 


Gly 


Gly 


Pro 


His 


Thr 


Gly 


Leu 


Glu 


Glu 


Lys 


Leu 


Val 


val 








ZO\J 




















270 






Gly 


Glu 


lie 


Asn 


Leu 


Asp 


Asp 


Leu 


Lys 


Phe 


Val 


Lys 


Val 


Trp 


Leu 


Asp 




275 










280 










285 








Ser 


Lys 


Gly 


His 


Tyr 


Ala 


Arg 


Pro 


Glu 


He 


Leu 


Lys 


Leu 


Gly 


Val 


Asn 




290 










295 










300 










Gin 


Lys 


Gin 


lie 


Trp 


Pro 


Asp 


Glu 


His 


Leu 


Leu 


Ala 


Arg 


Gin 


Asp 


Val 


305 










310 










315 










320 


Thr 


Glu 


Leu 


Leu 


Glu 


Ala 


Asp 


He 


He 


Glu 


Tyr 


Pro 


Leu 


Gin 


Leu 


Leu 










325 










330 










335 




Gin 


Asp 


Arg 


Ala 


Gin 

























340 



<210> 143 
<211> 1122 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 143 

atgacgatca 

atcgacaagg 

ttccctgagg 

caaaccgctg 

cggcgcttgc 

cgtctgcgcg 

ttgctgggcg 

ggcggcggac 

tgctgggagc 

catgccgccg 

cagatcgacg 

gcaaaccctg 

ctcgttaccg 

gcccctcaca 



ttgcaggcgc 
catgcgaaat 
ttttcgtacc 
cgctggccgc 
aggtggccgc 
gatctcgcac 
tgcaccgcaa 
acaccctcaa 
acacgatgaa 
cctggcctgg 
cgatgatgaa 
tggatcagac 
ccggcggagg 
ccggtgccga 



ggttcatgcg 
cattcgcaag 
cggctacccc 
ctatacggat 
acatcaggcc 
ctgcttcaac 
actccagccg 
ggtattcgac 
cctcgcgcgc 
gctttcgaca 
aactcatgcg 
ctgcctggag 
ctggtcggca 
ggagaaggtt 



gcgccggtat 
gcaggcaaag 
tacttcatcg 
gcctcgatcg 
ggcgtgatgg 
agccaggtgt 
acctatgtcg 
agcacactgg 
catgcgttga 
atggccgggt 
ttgagcgcgc 
tggatggaga 
atcgtccatc 
ctggtaggcg 



tcatggatgt 
acggaatcga 
agtgctatcc 
aggttccagg 
ttgtgatggg 
tcatcgaccg 
agcgcatcgt 
gcaaggtggg 
tcgcccaggg 
tcgaagcggt 
aatgctttgt 
aacacctcgg 
ctttctgtgg 
aaatcaatct 



cgatgccact 
gcttctcgtc 
gaccttgaac 
cccggaagtc 
cgtgagcgag 
tgacggcacc 
ctggggccag 
cggactggcc 
tatccagatc 
ggctgacgtc 
cgtatcggcc 
cccgcagcaa 
ttatatcgcc 
ggacgacctc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780' 
840 
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aagcaggtca aggtatgggt tgattccgca ggtcattatg cgcgcccgga agtcgtgcaa 
ttgcgcgacg ccctggagag ccgtggcaat tatcgcgttg cgctgacccg cgacgccgac 
accttcgtgc cgctggaaga ccgcgtgcgc tttgcgcgcc agcagaacgc cgacctcttc 
atctcgatcc acgccgacgc caacgccaac cacgatgcgc gcggggctgg cttcacttcg 
aaggttgaaa acctttccac gggcatttta ccaggcgatt ga 



900 
960 
1020 
1080 
1122 



<210> 144 
<211> 373 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 144 
Met Thr lie lie 
1 

Val Asp Ala Thr 
20 

Lys Asp Gly lie 
35 

Tyr Pro Tyr Phe 
50 

Leu Ala Ala Tyr 
65 

Arg Arg Leu Gin 

Gly Val Ser Glu 
100 

Val Phe He Asp 
115 

Gin Pro Thr Tyr 
130 

Thr Leu Lys Val 
145 

Cys Trp Glu His 

Gly He Gin He 
180 

Gly Phe Glu Ala 
195 

His Ala Leu Ser 
210 

Asp Gin Thr Cys 
225 

Leu Val Thr Ala 

Gly Tyr lie Ala 
260 

Gly Glu He Asn 
275 

Ser Ala Gly His 
290 

Leu Glu Ser Arg 
305 

Thr Phe Val Pro 

Ala Asp Leu Phe 
340 

Ala Arg Gly Ala 
355 



Ala Gly Ala Val 
5 

He Asp Lys Ala 

Glu Leu Leu Val 
40 

He Glu Cys Tyr 
55 

Thr Asp Ala Ser 
70 

Val Ala Ala His 
85 

Arg Leu Arg Gly 

Arg Asp Gly Thr 
120 

Val Glu Arg He 
135 

Phe Asp Ser Thr 
150 

Thr Met Asn Leu 
165 

His Ala Ala Ala 

Val Ala Asp Val 
200 

Ala Gin Cys Phe 
215 

Leu Glu Trp Met 
230 

Gly Gly Gly Trp 
245 

Ala Pro His Thr 

Leu Asp Asp Leu 
280 

Tyr Ala Arg Pro 

295' 

Gly Asn Tyr Arg 
310 

Leu Glu Asp Arg 
325 

He Ser He His 

Gly Phe Thr Ser 
360 



His Ala Ala Pro 
10 

Cys Glu He He 
25 

Phe Pro Glu Val 

Pro Thr Leu Asn 
60 

He Glu Val Pro 
75 

Gin Ala Gly Val 
90 

Ser Arg Thr Cys 
105 

Leu Leu Gly Val 

Val Trp Gly Gin 
140 

Leu Gly Lys Val 
l55 

Ala Arg His Ala 
170 

Trp Pro Gly Leu 
185 

Gin He Asp Ala 

Val Val Ser Ala 
220 

Glu Lys His Leu 
235 

Ser Ala He Val 
250 

Gly Ala Glu Glu 

265 

Lys Gin Val Lys 

Glu Val Val Gin 
300 

Val Ala Leu Thr 
315 

Val Arg Phe Ala 
330 

Ala Asp Ala Asn 
345 

Lys Val Glu Asn 



Val Phe Met Asp 
15 

Arg Lys Ala Gly 
30 

Phe Val Pro Gly 
45 

Gin Thr Ala Ala 

Gly Pro Glu Val 
80 

Met Val Val Met 
95 

Phe Asn Ser Gin 
110 

His Arg Lys Leu 
125 

Gly Gly Gly His 

Gly Gly Leu Ala 
160 

Leu He Ala Gin 
175 

Ser Thr Met Ala 
190 

Met Met Lys Thr 
205 

Ala Asn Pro Val 

Gly Pro Gin Gin 
240 

His Pro Phe Cys 
255 

Lys Val Leu Val 
270 

Val Trp Val Asp 
285 

Leu Arg Asp Ala 

Arg Asp Ala Asp 
320 

Arg Gin Gin Asn 
335 

Ala Asn His Asp 
350 

Leu Ser Thr Gly 
365 
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lie Leu Pro Gly Asp 
370 



<210> 145 
<211> 1014 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 145 

atgggcatca 

aacctcgagg 

gcgaagctga 

ggaacgcccg 

agctatgaca 

gttgttctcg 

ggaccagatg 

acggtgttcg 

cggctcgggg 

gcgcagaacg 

gcgcatgcgc 

tcgtgcttcg 

gacactcccg 

cctgacggaa 

acggtcaatc 

tcgcgtccgg 

tttgcgctgc 



cccacccgaa 
caacggtcga 
tagcgtttcc 
catgggcaat 
gcccgctcgc 
gcctctccga 
gcgagaccat 
gtgacggtga 
cgttgtgctg 
agcaggttca 
tgggttggga 
tgctcgctcc 
acaagcgcga 
gcccgctcgc 
ttgctgcgat 
acgttctaag 
cgcacgagca 



ctacaaggtc 
gaagacaatc 
ggaaacctgg 
cggtaagggc 
gcggcagatc 
gcgcgacggt 
cgcgaagcgg 
cggcagccac 
ctgggagcac 
cgtggcagca 
gacgaacaac 
ctgtgccgtt 
gcttgttcac 
agaaaagctc 
cggggttgcc 
gctgctattc 
gctcgagatc 



gcagtggtcc 
aggtatattg 
attccgggct 
ttcgtccagc 
gctgacgccg 
ggaagcctat 
cgcaagctgc 
atcgccgtgc 
gtgcagccgt 
tggccgagct 
gcggtcagca 
atttcgcaag 
gccggcggcg 
ggggaaaacg 
aagaatgccg 
aacaagagcc 
ggggcaggcc 



aggctgcgcc 
aagaggcggc 
atccatggca 
gctatttcga 
cagcaaagag 
acatcgcgca 
gtccgaccca 
atgaccgatc 
tgacgaaatt 
tctcgatgta 
aggtctacgc 
cgatggtgga 
gccacgcggt 
aagaggggct 
cggatccggc 
cggcccgaag 
cgtctggcga 



ggtctggttg 
caaggctgga 
catttggatc 
caactcgctc 
caagatcacg 
atggctgatc 
cgtcgagcgc 
cgatctgggc 
cgcgatgtac 
cgaacccttt 
ggtcgaggga 
cgagatgtgc 
gatttacggc 
tctctacgcg 
cgggcactat 
agtggagcat 
ctga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1014 



<210> 146 
<211> 337 
<212> PRT 
<213> Unknown 



<220> 

<223> 'Obtained from an environmental sample 



<400> 146 



Met 


Gly 


He 


Thr 


His 


Pro 


Asn 


Tyr 


1 








5 








Pro 


Val 


Trp 


Leu 


Asn 


Leu 


Glu 


Ala 








20 










lie 


Glu 


Glu 


Ala 


Ala 


Lys 


Ala 


Gly 






35 










40 


Thr 


Trp 


He 


Pro 


Gly 


Tyr 


Pro 


Trp 




50 










55 




Trp 


Ala 


He 


Gly 


Lys 


Gly 


Phe 


Val 


65 










70 






Ser 


Tyr 


Asp 


Ser 


Pro 


Leu 


Ala 


Arg 










85 








Ser 


Lys 


He 


Thr 


Val 


Val 


Leu 


Gly 








100 










Leu 


Tyr 


He 


Ala 


Gin 


Trp 


Leu 


He 






115 










120 


Lys 


Arg 


Arg 


Lys 


Leu 


Arg 


Pro 


Thr 




130 










135 




Asp 


Gly 


Asp 


Gly 


Ser 


His 


He 


Ala 


145 










150 






Arg 


Leu 


Gly 


Ala 


Leu 


Cys 


cys 


Trp 



165 



Lys 


Val 


Ala 


Val 


Val 


Gin 


Ala 


Ala 


10 










15 




Thr 


Val 


Glu 


Lys 


Thr 


He 


Arg 


Tyr 


25 










30 






Ala 


Lys 


Leu 


He 


Ala 


Phe 


Pro 


Glu 










45 








His 


He 


Trp 


He 


Gly 


Thr 


Pro 


Ala 








60 










Gin 


Arg 


Tyr 


Phe 


Asp 


Asn 


Ser 


Leu 






75 










80 


Gin 


He 


Ala 


Asp 


Ala 


Ala 


Ala 


Lys 




90 










95 




Leu 


Ser 


Glu 


Arg 


Asp 


Gly 


Gly 


Ser 


105 










110 






Gly 


Pro 


Asp 


Gly 


Glu 


Thr 


He 


Ala 










125 








His 


Val 


Glu 


Arg 


Thr 


Val 


Phe 


Gly 








140 










Val 


His 


Asp 


Arg 


Ser 


Asp 


Leu 


Gly 






155 










160 


Glu 


His 


Val 


Gin 


Pro 


Leu 


Thr 


Lys 




170 










175 
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Phe 


Ala 


Met 


Tyr 
180 


Ala 


Gin 


Asn 


Glu 


Gin 
185 


Val 


His 


Val 


Ala 


Ala 
190 


Trp 


Pro 


Ser 


Phe 


Ser 


Met 


Tyr 


Glu 


Pro 


Phe 


Ala 


His 


Ala 


Leu 


Gi-Ly 


Trp 


C* 1 n 

la-LU 


i nr 






195 








200 










205 








Asn 


Asn 
210 


Ala 


Val 


Ser 


Lys 


Val 
215 


Tyr 


Ala 


Val 


Glu 


Gly 
220 


Ser 


Cys 


Phe 




Leu 


Ala 


Pro 


Cys 


Ala 


Val 


lie 


Ser 


Gin 


Ala 


Met 


Val 


Asp 


Glu 


Met 


Cys 


225 








230 










235 










240 


Asp 


Thr 


Pro 


Asp 


Lys 


Arg 


Glu 


Leu 


Val 


His 


Ala 


Gly 


Gly 


Gly 


His 


Ala 






245 










250 










255 




Val 


lie 


Tyr 


Gly 


Pro 


Asp 


Gly 


Ser 


Pro 


Leu 


Ala 


Glu 


Lys 


Leu 




C 1 n 






260 










265 










270 






Asn 


Glu 


Glu 


Gly 


Leu 


Leu 


Tyr 


Ala 


Thr 


Val 


Asn 


Leu 


Ala 


Ala 


lie 


Gly 






275 








280 










285 








Val 


Ala 


Lys 


Asn 


Ala 


Ala 


Asp 


Pro 


Ala 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 




290 








295 










300 










Val 


Leu 


Arg 


Leu 


Leu 


Phe 


Asn 


Lys 


Ser 


Pro 


Ala 


Arg 


Arg 


Val 


Glu 


His 


305 








310 










315 










320 


Phe 


Ala 


Leu 


Pro 


His 


Glu 


Gin 


Leu 


Glu 


lie 


Gly 


Ala 


Gly 


Pro 


Ser 


Gly 



325 330 335 



Asp 



<210> 147 
<211> 1098 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 147 

atgacccagc 

atggacaccg 

ggcatcgaac 

tgctatccgc 

gtgcctggtc 

gtcctcggcg 

atcgacgccg 

cgcatcgtgt 

cggatcggcg 

gagcaggagc 

gagaccgtcg 

gtgttcgtca 

aacctcggtg 

ttcaactcct 

atcgacttct 

cggcccgagg 

gtgccgggac 

gtggtgctgc 

caggagctga 



acgagaccac 
acgcgaccat 
tcctggtgtt 
cgctgcagca 
cggagatcaa 
tcagcgagcg 
acgggagcct 
gggcccaggg 
gtctggcctg 
agcagatcca 
cggacgccca 
tctgcgcgtc 
aacagaagtt 
tcctcggcgg 
ccgacatccg 
tcctgcgact 
aggcgcaggt 
ctcagggggc 
cttcctga 



tgcccggagg 
cgacaaggtg 
ccccgagacc 
ggtggccgcc 
gcgggtgcag 
actcaggggt 
gctcggcgtg 
cggaggcgcg 
ctgggagcac 
cgcggcggcg 
gatcgaggcc 
caacccggtc 
cgtgaccgcc 
gccgcatacc 
cttggtcaag 
cgcggtcgac 
acgcacccgc 
ggcgcggccc 



ctggcagctg 
atcggcttcg 
ttcgtgcctg 
aacgcgcagt 
gcggcctgtg 
accaggacat 
caccgcaagc 
accctgtcgg 
acgatgaacc 
tggcctgccc 
atgatgaaga 
gacggcactt 
ggagggggct 
ggtttggagg 
gcctgggttg 
cgcaagccac 
gctgcttctc 
gctccgcaag 



tgcatgccgc 
tcgaacaggc 
gttaccccta 
acacggacgc 
cccgcgctgg 
gcttcaactc 
tgcagccgac 
tgttcggctc 
tggctcgtca 
tgtcgacgat 
cccatgcgct 
gcctggaatg 
ggtccgcggt 
agaagctcgt 
attcgaaggg 
tgtggcacga 
tggcagtgca 
actgggacac 



gcctgtgttc 
cggccgcgaa 
ctggatcgag 
ctccgtcgag 
cgtcgaagtc 
ccaggtgttc 
gtacgtggag 
ccgctccggc 
ggcactgctt 
ggcggggttc 
cacggcacag 
gatgcgggac 
catccacccc 
cagcgcgacg 
gcactacgcg 
cgagtgcgag 
ggagcacccg 
ctctgcggcg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1098 



<210> 148 
<211> 365 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 148 
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Met Thr Gin His Glu Thr Thr Ala Arg Arg Leu Ala Ala Val His Ala 

15 10 15 

Ala Pro Val Phe Met Asp Thr Asp Ala Thr lie Asp Lys Val lie Gly 

20 25 30 

Phe Val Glu Gin Ala Gly Arg Glu Gly He Glu Leu Leu Val Phe Pro 

35 40 45 

Glu Thr Phe Val Pro Gly Tyr Pro Tyr Trp He Glu Cys Tyr Pro Pro 

50 55 60 

Leu Gin Gin Val Ala Ala Asn Ala Gin Tyr Thr Asp Ala Ser Val Glu 
65 70 75 80 

Val Pro Gly Pro Glu He Lys Arg Val Gin Ala Ala Cys Ala Arg Ala 

85 90 95 

Gly Val Glu Val Val Leu Gly Val Ser Glu Arg Leu Arg Gly Thr Arg 

100 105 HO 

Thr Cys Phe Asn Ser Gin Val Phe He Asp Ala Asp Gly Ser Leu Leu 

115 120 125 

Gly Val His Arg Lys Leu Gin Pro Thr Tyr Val Glu Arg He Val Trp 

130 135 140 

Ala Gin Gly Gly Gly Ala Thr Leu Ser Val Phe Gly Ser Arg Ser Gly 
145 150 155 160 

Arg He Gly Gly Leu Ala Cys Trp Glu His Thr Met Asn Leu Ala Arg 

165 1*70 175 

Gin Ala Leu Leu Glu Gin Glu Gin Gin He His Ala Ala Ala Trp Pro 

180 185 190 

Ala Leu Ser Thr Met Ala Gly Phe Glu Thr Val Ala Asp Ala Gin He 

195 200 205 

Glu Ala Met Met Lys Thr His Ala Leu Thr Ala Gin Val Phe Val He 

210 215 220 

Cvs Ala Ser Asn Pro Val Asp Gly Thr Cys Leu Glu Trp Met Arg Asp 
225 230 235 240 

Asn Leu Gly Glu Gin Lys Phe Val Thr Ala Gly Gly Gly Trp Ser Ala 

245 250 255 

Val He His Pro Phe Asn Ser Phe Leu Gly Gly Pro His Thr Gly Leu 

260 265 270 

Glu Glu Lys Leu Val Ser Ala Thr He Asp Phe Ser Asp He Arg Leu 

275 280 285 

Val Lys Ala Trp Val Asp Ser Lys Gly His Tyr Ala Arg Pro Glu Val 

290 295 300 

Leu Arq Leu Ala Val Asp Arg Lys Pro Leu Trp His Asp Glu Cys Glu 
305 310 315 320 

Val Pro Gly Gin Ala Gin Val Arg Thr Arg Ala Ala Ser Leu Ala Val 

325 330 335 

Gin Glu His Pro Val Val Leu Pro Gin Gly Ala Ala Arg Pro Ala Pro 

340 345 350 

Gin Asp Trp Asp Thr Ser Ala Ala Gin Glu Leu Thr Ser 
355 360 365 

<210> 149 
<211> 942 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental ' sample 

<400> 149 ^ ^ 

atgacgaagc ttgagaaggt ggtcgcggcg gcggtccagg cgacgccgga gttcctcgac 

cgcgaggcga ccgtcgagaa ggccgtgcgg ctgatcaagg aagcggccgg ggagggcgcc 

ggcctgatcg tgttccccga gacgttcatc ccgacgtacc cggactgggt ctggcgcgcg 

ccggcctggg acggcccatc cgcggacctg tacgcaatgc tgctggagaa cgcggtggag 

atccccgggc cggtgacgga gaccctgggg aaggcggcga agcaggccaa ggccttcgtg 



60 
120 
180 
240 
300 
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tcgatgggcg 
ttcggacccg 
cgcctggtgt 
cgcctgggcg 
gccaagggcg 
acgctccgcc 
cgcgggtcgg 
gactggatgt 
ccgctgacgg 
tcacggcacc 
gtggacgagt 



tcaacgagcg 
acgggagcgt 
gggggatggg 
ggctgatctg 
tggacgtcta 
acatcgccaa 
acgtccccga 
cgcgcggctt 
aggaggaagg 
agttcgatcc 
cgcccaagcc 



cgagccgggc 
gctcggcaag 
cgacgggtcg 
ctgggagaac 
cgttgcgccg 
ggaggggcgg 
cgacgtgccg 
ctccaccatc 
catcctctac 
ggtggggcac 
ccagacgtcc 



ggcgggacga tctacaacac gcaggtcacg 
caccgcaagc tgatgccgac cggcggcgag 
atgctccagg tctatgacac gccgttcggc 
tacatgccgc tcgcgcgcta ctcgatgtac 
acgtgggaca acagcgacat gtgggtggcg 
ctgtacgtga tcggcgtggc gccgctgctg 
gggaaggccg agctgtgggg cggcgatgac 
gtcgcgccgg gcggcgaggt gctggccggt 
gcggagatcg acccggcgag agcccgttcg 
tactcgcgcc ccgacgtgtt tcggctcgtc 
ggcccgggct ag 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
942 



<210> 150 
<211> 313 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 150 



M 0 4- 


Thr 


Lys 


Leu 


Glu 


Lys 


Val 


Val 


Ala 


Ala 


Ala 


Val 


Gin 


Ala 


Thr 


Pro 


1 
















10 










15 




uiU 


fne 


Leu 


Asp 
20 


Arg 


ul Li 


Ala 


Thr 


Val 
25 


Glu 


Lys 


Ala 


Val 


Arg 
30 


Leu 


He 


Lys 




35 


AT ^ 
nla 




Glu 


Gly 


Ala 
40 


Gly 


Leu 


He 


Val 


Phe 
45 


Pro 


Glu 


Thr 


irne 


Tl o 

-L ie 
50 


Pro 


± Hi. 


±yr 


Pro 


Asp 
55 


iip 


Val 


Trn 


Arg 


Ala 
60 


Pro 


Ala 


Trp 


Asp 




XT i. u 


Ser 


Ala 


Asp 


Leu 


Tvr 


Ala 


Met 


Leu 


Leu 


Glu 


Asn 


Ala 


Val 


Glu 


65 










70 










75 










80 


lie 


Pro 


Glv 


Pro 


Val 


Thr 


Glu 


Thr 


Leu 


Gly 


Lys 


Ala 


Ala 


Lys 


Gin 


Ala 








85 










90 










95 




Lys 


Ala 


Phe 


Val 


Ser 


Met 


Gly 


Val 


Asn 


Glu 


Arg 


Glu 


Pro 


Gly 


Gly 


Gly 






100 










105 










110 






Thr 


He 


Tyr 
115 


Asn 


Thr 


Gin 


Val 


Thr 
120 


Phe 


Gly 


Pro 


Asp 


Gly 
125 


Ser 


Val 


Leu 


Gly 


Lys 
130 


His 


Arg 


Lys 


Leu 


Met 
135 


Pro 


Thr 


Gly 


Gly 


Glu 
140 


Arg 


Leu 


Val 


Trp 


Gly 


Met 


Gly 


Asp 


Gly 


Ser 


Met 


Leu 


Gin 


Val 


Tyr 


Asp 


Thr 


Pro 


Phe 


Gly 


145 










150 










155 










160 


Arg 


Leu 


Gly 


Gly 


Leu 


He 


Cys 


Trp 


Glu 


Asn 


Tyr 


Met 


Pro 


Leu 


Ala 


Arg 








165 










170 










175 




Tyr 


Ser 


Met 


Tyr 


Ala 


Lys 


Gly 


Val 


Asp 


Val 


Tyr 


Val 


Ala 


Pro 


Thr 


Trp 






180 










185 










190 






Asp 


Asn 


Ser 


Asp 


Met 


Trp 


Val 


Ala 


Thr 


Leu 


Arg 


His 


He 


Ala 


Lys 


Glu 




195 










200 










205 








Gly 


Arg 


Leu 


Tyr 


Val 


He 


Gly 


Val 


Ala 


Pro 


Leu 


Leu 


Arg 


Gly 


Ser 


Asp 


210 










215 










220 










Val 


Pro 


Asp 


Asp 


Val 


Pro 


Gly 


Lys 


Ala 


Glu 


Leu 


Trp 


Gly 


Gly Asp 


Asp 


225 










230 










235 










240 


Asp 


Trp 


Met 


Ser 


Arg 


Gly 


Phe 


Ser 


Thr 


He 


Val 


Ala 


Pro 


Gly 


Gly 


Glu 








245 










250 










255 




Val 


Leu 


Ala 


Gly 
260 


Pro 


Leu 


Thr 


Glu 


Glu 
265 


Glu 


Gly 


He 


Leu 


Tyr 
270 


Ala 


Glu 


He 


Asp 


Pro 
275 


Ala 


Arg 


Ala 


Arg 


Ser 
280 


Ser 


Arg 


His 


Gin 


Phe 
285 


Asp 


Pro 


Val 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 


Val 


Phe 


Arg 


Leu 


Val 


Val 


Asp 


Glu 


Ser 


290 










2 95 










300 










Pro 


Lys 


Pro 


Gin 


Thr 


Ser 


Gly 


Pro 


Gly 
















305 










310 
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<210> 151 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 151 

atgagagtcg 

acagtcgata 

acgttcccgg 

caaatccttg 

cttgccaccg 

gtcaacgagc 

acgttgattc 

cagggcgatg 

gcttgttttg 

atccactctg 

atcaacgtaa 

ctggatgccg 

tcgggcggtt 

cactcgggcg 

cacgtggtgg 

cggactccca 

gactcgcaga 



ttaaagccgc 
acgtcgtcaa 
aaaccgtggt 
ccggcggtga 
aagcgatcgg 
gtgacggggg 
aaagacgacg 
gctcaggttt 
agcactacaa 
cgatgttccc 
ggcagcatgc 
accaacaggc 
gcttcaccgc 
aaggcgtttg 
acacgcgcgg 
cggcacacat 
atgtattcac 



cgcggtccaa 

gaagatccac 

gccttactat 

gttcctaaag 

cgaggcctgc 

aactctgtac 

caagatcacg 

acgggcggtt 

cccgcttgcg 

gggctccatg 

aatggagtct 

acaaatcatg 

tatcgtgaca 

tattgccgac 

ccactacagc 

acacgaacgg 

cgctattgct 



ctgaaaccag 
gagctgggcc 
ccgtactttt 
ctgcttgatc 
aggcaagcgg 
aatacgcaac 
cccacccatt 
gacagcaagg 
cgttacgcca 
ttcggcgatg 
ggatgctttg 
aaggacacag 
cccgacggga 
ctcgatttca 
cggccagaat 
aacgagcaac 
taa 



tcctttatag 
aacaaggagt 
cgatcgtgca 
agtcagtgac 
gcgtcgttgt 
ttctctttga 
acgagcgcat 
tcgcgcgcat 
tgatggccga 
cgttttcaga 
tcgtctgcgc 
gctgcgagat 
cgctgatagg 
agctcatcga 
tgctcagcct 
cgaagtcggc 



ccgtgaggga 
gcagttcgca 
gtccggctat 
cgtgccatct 
ctccatcggc 
tgccgacggc 
ggtctggggc 
tggtcaactg 
tggcgagcaa 
gaaggtggaa 
tacggcctgg 
cggtccgatc 
cgaacccatc 
caagcggaag 
cctaattgat 
cgttgagcaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
993 



<210> 152 
<211> 330 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from 
<400> 152 

Met Arg Val Val Lys 

1 5 
Ser Arg Glu Gly Thr 

20 

Gly Gin Gin Gly Val 
35 

Tyr Tyr Pro Tyr Phe 
50 

Gly Gly Glu Phe Leu 
65 

Leu Ala Thr Glu Ala 
85 

Val Ser He Gly Val 
100 

Gin Leu Leu Phe Asp 
115 

He Thr Pro Thr His 
130 

Ser Gly Leu Arg Ala 
145 

Ala Cys Phe Glu His 
165 

Asp Gly Glu Gin lie 
• 180 



an environmental sample 



Ala Ala Ala Val 
Val Asp 
Gin Phe 



Ser He 

55 
Lys Leu 
70 

He Gly 

Asn Glu 

Ala Asp 

Tyr Glu 
135 
Val Asp 
150 

Tyr Asn 
His Ser 



Asn Val 

25 
Ala Thr 
40 

Val Gin 



Leu Asp 

Glu Ala 

Arg Asp 
105 
Gly Thr 
120 

Arg Met 

Ser Lys 

Pro Leu 

Ala Met 
185 



Gin Leu Lys 
10 

Val Lys Lys 

Phe Pro Glu 

Ser Gly Tyr 
60 

Gin Ser Val 
75 

Cys Arg Gin 
90 

Gly Gly Thr 

Leu He Gin 

Val Trp Gly 
140 

Val Ala Arg 

155 
Ala Arg Tyr 
170 

Phe Pro Gly 



Pro Val 

He His 

30 
Thr Val 
45 

Gin He 

Thr Val 

Ala Gly 

Leu Tyr 
110 
Arg Arg 
125 

Gin Gly 

He Gly 

Ala Met 

Ser Met 
190 



Leu Tyr 
15 

Glu Leu 

Val Pro 

Leu Ala 

Pro Ser 

80 
Val Val 
95 

Asn Thr 

Arg Lys 

Asp Gly 

Gin Leu 
160 
Met Ala 
175 

Phe Gly 
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Asp Ala Phe Ser 
195 

Glu Ser Gly Cys 
210 

Gin Gin Ala Gin 
225 

Ser Gly Gly Cys 

Gly Glu Pro lie 
260 

Phe Lys Leu lie 
275 

Tyr Ser Arg Pro 
290 

Ala His He His 
305 

Asp Ser Gin Asn 



Glu Lys Val Glu 
200 

Phe Val Val Cys 
215 

He Met Lys Asp 

230 

Phe Thr Ala He 

245 

His Ser Gly Glu 

Asp Lys Arg Lys 
280 

Glu Leu Leu Ser 

295 

Glu Arg Asn Glu 
310 

Val Phe Thr Ala 
325 



T 1 & 


Asn 


Val 


Arg 


Ala 


Thr 


Ala 


Trp 








220 


Thr 




Cys 


ulU 






ZOO 




Val 


J- 1 IX 


Pro 


Asp 




o ^ n 






pi 

\j±y 


Vdl 


Cys 


lie 


Z DO 








His 


Val 


Val 


Asp 


Leu 


Leu 


He 


Asp 








300 


Gin 


Pro 


Lys 


Ser 






315 




He 


Ala 








330 







Gin His Ala Met 
205 

Leu Asp Ala Asp 

He Gly Pro He 
240 

Gly Thr Leu He 
255 

Ala Asp Leu Asp 
270 

Thr Arg Gly His 
285 

Arg Thr Pro Thr 

Ala Val Glu Gin 
320 



<210> 153 
<211> 1074 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 153 

atgccaaacg 

actgaagcga 

gatatcgtcg 

gacaactctg 

gtgggtggcg 

gtcatgggtt 

ttgtttttcg 

cacgagcggt 

atcggt'atgt 

atggcgacta 

gacgcggcaa 

gccgaatttg 

ggatattttc 

gcgcagtggg 

ccactcatta 

tggaaagcct 

ctgcataacg 

aagccccaat 



caagaaagat 
atctcggtaa 
tcttcgccga 
ccaagaacta 
atgaatgccg 
gcaatgagct 
accgaaaggg 
tgatccatgg 
tgggtgggtt 
tgggtgaaga 
tcggtgagag 
cgatccgaga 
cgaaggacaa 
ctgtgggcgg 
atgaggagaa 
ggatcgatcc 
ttgctgggcg 
cgctgtgggt 



tgttggagcc 
agcgatagcg 
atgctatttg 
ttccaaggtt 
ggctattgct 
atccgaccga 
cgagttgatc 
cacaggcgac 
gatttgctgg 
agttcatgtt 
gatggtcgaa 
atatgcggca 
tatatccgat 
gagttctatc 
gatcctctgc 
gattggtcac 
tgagtattcc 
agatgcgtcc 



gtggcccaag 
gcgattcaca 
ggccaatatc 
tggacggccc 
gctgcggcta 
gccggcggcg 
ggtcgacacc 
ggaagagact 
gagcaccata 
gcaagctggc 
gcggatcttg 
gagacaggaa 
gagtggcgcg 
gtggcaccgg 
gccgaactcg 
tattcgcgtc 
tatcaggccg 
gaggaagacg 



ttgcacagga 
atgctgcgaa 
catattgggc 
tgtacgacgg 
gacagtccaa 
caacgttata 
ggaaattgat 
tgaatgttta 
tgtcgctctc 
ctgggatgtg 
gggcgccgtt 
atttcgttct 
aagcgattcc 
ggggctccta 
atttcaatct 
ccgatgttta 
tagatttgaa 
gtgcgctgaa 



attcttcgac 
gcaaggcgca 
gcaattttac 
tgcgatcact 
gattcatgtc 
caacagcctc 
gccgtcgatg 
cgataccgat 
gaagtatgcc 
gcgcggagga 
tgtttgtgac 
aagcgcgtct 
aaaccttcaa 
tctggtccca 
caggcgtctt 
tagcctgcaa 
gcgcacgcca 
ttga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1074 



<210> 154 
<211> 357 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
<400> 154 

Met Pro Asn Ala Arg Lys He Val Gly Ala Val Ala Gin Val Ala Gin 

15 10 15 

Glu Phe Phe Asp Thr Glu Ala Asn Leu Gly Lys Ala He Ala Ala He 

20 25 30 

His Asn Ala Ala Lys Gin Gly Ala Asp He Val Val Phe Ala Glu Cys 
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35 




Tvr 


Leu 


Gly 


Gin 




50 






Lys 


Asn 




Ser 


65 








Val 


Gly 


Gly 


Asp 


Lys 


lie 


His 


Val 








100 


Gly 


Ala 


Thr 


Leu 






1 1 c. 

J. ± O 




Leu 


lie 


Gly 






j. j> \j 






Tip 


iiJ- o 


\J-Ly 


Tin T* 


X 1 O 








lie 


Gly 


Met 


Leu 


Ser 


Lys 


Tyr 


Ala 








180 


-Lip 


IT J. V_J 


\J -Ly 








195 




Val 


Glu 


Ala 


Asp 




210 






lie 


Arg 


Glu 


Tvr 

±yx 


225 








Gly 


j.yj_ 


Phe 


Pro 


Pro 


Asn 


Leu 


Gin 








260 


Pro 


Gly 


Gly 


Ser 






275 




Leu 


Cys 


Ala 


Glu 




9 q n 

Z U 






lie 


Asp 


Pro 


lie 


305 








Leu 


His 


Asn 


Val 


Lys 


Arg 


Thr 


Pro 








340 


Asp 


Gly 


Ala 


Leu 






355 





iyr 


Pro 


Tyr 


Trp 






o o 




Lys 


V ct_L 


Trp 


i nr 




70 






pi,. 


Cys 


Arg 


Ala 










T J-. "I 

vai 


ixieti 


bly 


Cys 


Tyr 


Asn 


Ser 


Leu 








i 9 n 


HIS 


Arg 


Lys 


Leu 






J. JO 




biy 


Asp 


Gly 


Arg 










taiy 


Gly 


Leu 


He 


X DO 








4- 

Met 


Ala 


Thr 


Met 


Trp 


Arg 


Gly 


Gly 








9 n n 
z u u 


Leu 


uiy 


7\ 1 -a 
Fi.J-3. 


Pro 






91 ^ 
ZJ.J 




-rilcl 


rlia 


vjIU 


inr 




9 *30 






Lys 


Asp 


Asn 


Tl - 

lie 


245 








a 1 a 
riXa 


bin 


Trp 


ax a 


Tyr 


Leu 


Val 


Pro 








280 


Leu 


Asp 


Phe 


Asn 






295 




Gly 


His 


Tyr 


Ser 




310 






Ala 


Gly 


Arg 


Glu 


325 








Lys 


Pro 


Gin 


Ser 



Asn 



Ala Gin Phe Tyr 
60 

Ala Leu Tyr Asp 
75 

He Ala Ala Ala 
90 

Asn Glu Leu Ser 
105 

Leu Phe Phe Asp 

Met Pro Ser Met 
140 

Asp Leu Asn Val 
155 

Cys Trp Glu His 
170 

Gly Glu Glu Val 
185 

Asp Ala Ala He 

Phe Val Cys Asp 
220 

Gly Asn Phe Val 
235 

Ser Asp Glu Trp 
250 

Val Gly Gly Ser 
265 

Pro Leu He Asn 

Leu Arg Arg Leu 
300 

Arg Pro Asp Val 
315 

Tyr Ser Tyr Gin 
33'0 

Leu Trp Val Asp 
345 



Asp 


Asn 


Ser 


Ala 


Gly Ala 


He 


Thr 








o U 


Ala 


Arg 


Gin 


Ser 






95 




Asp 


Arg 


Ala 


Gly 




110 


- 




Arg 


Lys 


Gly 


Glu 


125 








His 


Glu 


Arg 


Leu 


Tyr 


Asp 


Thr 


Asp 








160 


His 


Met 


Ser 


Leu 






175 




His 


va_L 


Ala 


Ser 




190 






Gly 


Glu 


Arg 


Met 


205 








Ala 


Glu 


Phe 


Ala 


lieu 


Ser 


Ala 


Ser 








Z4 Q 


Arg 


Glu 


Ala 


He 






n a c 

zoo 




Ser 


He 


Val 


Ala 




270 






Glu 


Glu 


Lys 


He 


285 








Trp 


Lys 


Ala 


Trp 


Tyr 


Ser 


Leu 


Gin 








320 


Ala 


Val 


Asp 


Leu 






335 




Ala 


Ser 


Glu 


Glu 




350 







<210> 155 
<211> 1041 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 155 

atgggcatcg 

gatctcgacg 

gctaagctga 

gactcgccgg 

gcctacgaca 

gccgtgattg 

ggccctgatg 

accgtttttg 

cggctgggag 

gcgcagaacg 



aacatccgaa 
gctcgatcaa 
tcgctttccc 
cctgggcgat 
gcccgcaagc 
gcctgtcgga 
gcgagaccat 
gcgagggtga 
cgctgtgctg 
aacaggtcca 



atacaaggtg 
gaaggcgatt 
cgaaaccttc 
cggccgcggc 
cgaaaagctg 
gcgcgacggc 
cgcaaaacgc 
cggcagcgac 
ctgggagcac 
tgtcgcgtca 



gccgtggtgc 
gcgctgatcg 
attcccggct 
tttgtgcagc 
cgcgccgcgg 
ggcagcctct 
agaaagctgc 
cttgccgtgc 
ctgcaaccgc 
tggccgagct 



aggccgcgcc 
aggaagcggc 
atccctggca 
gctacttcga 
tcaagaaggc 
atatagcgca 
ggccaacgca 
acgaccggcc 
tttcgaaata 
tctcgctcta 



ggcctggctc 
cgccaagggc 
catctggctg 
taactcgctg 
caagctcact 
atggctgatt 
cgcggaacgc 
cggaatcggg 
cgcgatgtat 
cgaccccttc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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gcgccggcgc tcggcgccga ggtcaacaat gcggcttccc gcgtctacgc ggtcgagggc 6 60 

tcgtgcttcg tgctggcgcc gtgcgccacg gtttcgcaag ccatgatcga cgagctgtgt 720 

gaccggccgg acaagcatgc gctgttgcac gccggtggcg gacacgccgc gatttacggc 78 0 

ccggacggca gctcgatcgc ggagaagctg ccgcaggacg cggagggcct gttgatcgcc 8 40 

gagatcgatc tcggggcgat cggggttgcc aagaatgcag ccgacccggc cggtcattat 900 

tcgcggccgg acgtgacgcg actcctgctg aacaagaacc ggatgcgaag ggtcgaggag 960 

tttgcgctgc cggtcgatcc ggtcgcaacg accgaggagg agcaagtcgc gacgccgtcg 1020 

aggcccagcc aggccgcgta a 1041 

<210> 156 
<211> 346 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 156 



Met 


Gly 


He 


Glu 


His 


Pro 


Lys 


Tyr 


Lys 


Val 


Ala 


Val 


Val 


Gin 


Ala 


Ala 


1 








5 










10 










15 




Pro 


Ala 


Trp 


Leu 


Asp 


Leu 


Asp 


Gly 


Ser 


He 


Lys 


Lys 


Ala 


He 


Ala 


Leu 








20 










25 










30 






He 


Glu 


Glu 


Ala 


Ala 


Ala 


Lys 


Gly 


Ala 


Lys 


Leu 


He 


Ala 


Phe 


Pro 


Glu 






35 










40 










45 








Thr 


Phe 


He 


Pro 


Gly 


Tyr 


Pro 


Trp 


His 


He 


Trp 


Leu 


Asp 


Ser 


Pro 


Ala 




50 










55 










60 










Trp 


Ala 


He 


Gly 


Arg 


Gly 


Phe 


Val 


Gin 


Arg 


Tyr 


Phe 


Asp 


Asn 


Ser 


Leu 


65 










70 










75 










80 


Ala 


Tyr 


Asp 


Ser 


Pro 


Gin 


Ala 


Glu 


Lys 


Leu 


Arg 


Ala 


Ala 


Val 


Lys 


Lys 










85 










90 










95 




Ala 


Lys 


Leu 


Thr 


Ala 


Val 


He 


Gly 


Leu 


Ser 


Glu 


Arg 


Asp 


Gly 


Gly 


Ser 








100 










105 










110 






Leu 


Tyr 


He 


Ala 


Gin 


Trp 


Leu 


He 


Gly 


Pro 


Asp 


Gly 


Glu 


Thr 


He 


Ala 






115 










120 










125 








Lys 


Arg 


Arg 


Lys 


Leu 


Arg 


Pro 


Thr 


His 


Ala 


Glu 


Arg 


Thr 


Val 


Phe 


Gly 




130 










135 










140 










Glu 


Gly 


Asp 


Gly 


Ser 


Asp 


Leu 


Ala 


Val 


His 


Asp 


Arg 


Pro 


Gly 


He 


Gly 


145 










150 










155 










160 


Arg 


Leu 


Gly 


Ala 


Leu 


Cys 


Cys 


Trp 


Glu 


His 


Leu 


Gin 


Pro 


Leu 


Ser 


Lys 










165 










170 










175 




Tyr 


Ala 


Met 


Tyr 


Ala 


Gin 


Asn 


Glu 


Gin 


Val 


His 


Val 


Ala 


Ser 


Trp 


Pro 








180 










185 










190 






Ser 


Phe 


Ser 


Leu 


Tyr 


Asp 


Pro 


Phe 


Ala 


Pro 


Ala 


Leu 


Gly Ala 


Glu 


Val 






195 










200 










205 








Asn 


Asn 


Ala 


Ala 


Ser 


Arg 


Val 


Tyr 


Ala 


Val 


Glu 


Gly 


Ser 


Cys 


Phe 


Val 




210 










215 










220 










Leu 


Ala 


Pro 


Cys 


Ala 


Thr 


Val 


Ser 


Gin 


Ala 


Met 


He 


Asp 


Glu 


Leu 


Cys 


225 










230 










235 










240 


Asp 


Arg 


Pro 


Asp 


Lys 


His 


Ala 


Leu 


Leu 


His 


Ala 


Gly 


Gly 


Gly 


His 


Ala 










245 










250 










255 




Ala 


He 


Tyr 


Gly 


Pro 


Asp 


Gly 


Ser 


Ser 


He 


Ala 


Glu 


Lys 


Leu 


Pro 


Gin 








260 










265 










270 






Asp 


Ala 


Glu 


Gly 


Leu 


Leu 


He 


Ala 


Glu 


He 


Asp 


Leu 


Gly Ala 


He 


Gly 






275 










280 










285 








Val 


Ala 


Lys 


Asn 


Ala 


Ala 


Asp 


Pro 


Ala 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 




290 










295 










300 










Val 


Thr 


Arg 


Leu 


Leu 


Leu 


Asn 


Lys 


Asn 


Arg 


Met 


Arg 


Arg 


Val 


Glu 


Glu 


305 










310 










315 










320 


Phe 


Ala 


Leu 


Pro 


Val 


Asp 


Pro 


Val 


Ala 


Thr 


Thr 


Glu 


Glu 


Glu 


Gin 


Val 










325 










330 










335 




Ala 


Thr 


Pro 


Ser 


Arg 


Pro 


Ser 


Gin 


Ala 


Ala 
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340 345 

<210> 157 
<211> 1011 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 157 

atgagagtcg ttaaagctgc tgcggtccaa ctgagtcccg tgctgtatag ccgtgaggga 60 

acagtagaaa aggtcgttcg gaagatccac gagcttggcg atcaaggagt cgagttcgcc 120 

acgttcccgg agaccgtagt gccctactat ccgtacttct cggccgtcca gacgccgatt 180 

cagaacatgc acggcccgga gcacctgaag ttgctcgagc aatcggtgac cgtcccgtcg 24 0 

cccgccaccg acgcgatcgg cgacgcctgc cgccacgccg gcgtcgtcgt ctcgatcggc 300 

gtcaacgaac gcgatggcgg cacgatctac aacacgcagc tcctgttcga cgccgacggc 360 

accttgatcc agcgccggcg aaagatcacg ccgaccttct acgaacgaat ggtctgggga 420 

cagggtgacg gttcggggct gcgcgccgtc gacagccgcg taggacgcat cggccagctc 4 80 

gcctgtttcg agcactacaa cccgctggcg cgctacgcca tgatggccga cggcgagcag 54 0 

attcactccg cgatgtaccc cggctccatc tttggagacg cattcgcgca gaaaatcgag 600 

atcaacatcc gccagcacgc gctcgagtcc ggtgcgttcg tcgtcaacgc caccgcctgg 660 

ctcgatgccg accagcaggc gcggatcatg aaggataccg gctgcaccat cgaaccgatc 72 0 

tcgggcggtt gcttcaccgc catcgtcacc ccggacggga ccctgctggg cgaagcgata 780 

cgttcggggg agggagtggt ggtcgccgat ctcgacttca cgctgatcga caggcgcaag 84 0 

caagtgatgg actctcgtgg tcactacagt cggccggagt tgctcagcct tctgatcgac 900 

cgcacaccca ccgcacacct acacgaacgc gaagcgcacc ccagagcaag tgaggactgg 960 

caaggttccg agagtctgcg cgccatgcag gcctcggcac cgaaggtctg a 1011 



<210> 158 
<211> 336 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 158 



Met 


Arg 


Val 


Val 


Lys 


Ala 


Ala 


Ala 


Val 


Gin 


Leu 


Ser 


Pro 


Val 


Leu 


Tyr 


1 








5 










10 










15 




Ser 


Arg 


Glu 


Gly 

20 


Thr 


Val 


Glu 


Lys 


Val 

25 


Val 


Arg 


Lys 


He 


His 
30 


Glu 


Leu 


Gly 


Asp 


Gin 
35 


Gly 


Val 


Glu 


Phe 


Ala 
40 


Thr 


Phe 


Pro 


Glu 


Thr 

45 


Val 


Val 


Pro 


Tyr 


Tyr 


Pro 


Tyr 


Phe 


Ser 


Ala 


Val 


Gin 


Thr 


Pro 


He 


Gin 


Asn 


Met 


His 


50 








55 










60 










Gly 


Pro 


Glu 


His 


Leu 


Lys 


Leu 


Leu 


Glu 


Gin 


Ser 


Val 


Thr 


Val 


Pro 


Ser 


65 










70 










75 










80 


Pro 


Ala 


Thr 


Asp 


Ala 


He 


Gly 


Asp 


Ala 


Cys 


Arg 


His 


Ala 


Gly 


Val 


Val 








85 










90 










95 




Val 


Ser 


He 


Gly 
100 


Val 


Asn 


Glu 


Arg 


Asp 
105 


Gly 


Gly 


Thr 


He 


Tyr 
110 


Asn 


Thr 


Gin 


Leu 


Leu 
115 


Phe 


Asp 


Ala 


Asp 


Gly 
120 


Thr 


Leu 


He 


Gin 


Arg 
125 


Arg 


Arg 


Lys 


He 


Thr 
130 


Pro 


Thr 


Phe 


Tyr 


Glu 
135 


Arg 


Met 


Val 


Trp 


Gly 
140 


Gin 


Gly 


Asp 


Gly 


Ser 


Gly 


Leu 


Arg 


Ala 


Val 


Asp 


Ser 


Arg 


Val 


Gly 


Arg 


He 


Gly 


Gin 


Leu 


145 








150 










155 










160 


Ala 


-Cys 


Phe 


Glu 


His 


Tyr 


Asn 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ala 


Met 


Met 


Ala 








165 










170 










175 




Asp 


Gly 


Glu 


Gin 


He 


His 


Ser 


Ala 


Met 


Tyr 


Pro 


Gly 


Ser 


He 


Phe 


Gly 
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180 

Asp Ala Phe Ala 
195 

Glu Ser Gly Ala 
210 

Gin Gin Ala Arg 
225 

Ser Gly Gly Cys 

Gly Glu Ala He 
260 

Phe Thr Leu He 
275 

Tyr Ser Arg Pro 
290 

Ala His Leu His 
305 

Gin Gly Ser Glu 



Gin Lys lie Glu 
200 

Phe Val Val Asn 
215 

He Met Lys Asp 
230 

Phe Thr Ala He 
245 

Arg Ser Gly Glu 

Asp Arg Arg Lys 
280 

Glu Leu Leu Ser 
295 

Glu Arg Glu Ala 
310 

Ser Leu Arg Ala 
325 



185 

He Asn He Arg 

Ala Thr Ala Trp 
220 

Thr Gly Cys Thr 
235 

Val Thr Pro Asp 
250 

Gly Val Val Val 
265 

Gin Val Met Asp 

Leu Leu He Asp 
300 

His Pro Arg Ala 
315 

Met Gin Ala Ser 
330 



190 

Gin His Ala Leu 
205 

Leu Asp Ala Asp 

He Glu Pro He 
240 

Gly Thr Leu Leu 
255 

Ala Asp Leu Asp 
270 

Ser Arg Gly His 
285 

Arg Thr Pro Thr 

Ser Glu Asp Trp 
320 

Ala Pro Lys Val 
335 



<210> 159 
<211> 930 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 159 

atgtcatcaa 

gccacgctgg 

attgtcttcg 

gtcgcgctgt 

atcgctgtgc 

gtgctggtgt 

aacacgctgc 

ccgacctata 

gatactgccg 

cgccagacca 

gagatgcacc 

gttggcggca 

ccgcccacgc 

ctggcgggtc 

atcatccgcg 

gacctgaccg 



ccgtgacggt 
acaaagccgc 
gcgagacatg 
ggaatcatcc 
caagcaagga 
tgagcattaa 
tcacgattga 
ccgagcgcat 
tcgggcgcgt 
tgcacatcag 
agattgccag 
ttcttgcggc 
agcttgtcca 
cagtctataa 
agagcatgac 
tgaagcgcag 



tgccattatt 
caaactggtg 
gtttccgggg 
cccgaccaag 
actcgatttt 
tgaacgtgtg 
cgccgatggc 
ggtgtggggc 
aggcggctta 
cggcgaacag 
ccgccagtat 
acaggacttg 
gcgcggcggc 
tgaggaaacc 
gctggatgtc 
ccgaccatga 



caggcagcac 
gcggatgcgg 
tatccggcat 
caggtatttg 
ctgggggcgc 
gagcagggcg 
acgctggcaa 
atgggcgacg 
atctgctggg 
attcatattt 
gcctttgaag 
cccgccgaac 
agcgccatta 
atcctgaccg 
accggacatt 



ccgtgtatta 
cggcacaggg 
ggctggatta 
agcgcctgca 
tggcacgcaa 
cggggcatgg 
atcatcatcg 
gggtggggtt 
aacactggat 
ccgtcttccc 
ggcggacgtt 
tggaacgccc 
tcgcgccgga 
caacgctgga 
atgcccgccc 



tgacctgcct 
cgcaacgctg 
ctgccccaat 
tcgcaacagc 
gcatcaggtg 
cacgctgtat 
caaactgatg 
gcaagcggtg 
gccgttggca 
aaccgtccat 
tgtgctgacc 
cgccgatttg 
tggtcgttat 
tttgggcgag 
ggatgttttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
930 



<210> 160 
<211> 309 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 160 

Met Ser Ser Thr Val Thr Val Ala He He Gin Ala Ala Pro Val Tyr 

15 10 15 

Tyr Asp Leu Pro Ala Thr Leu Asp Lys Ala Ala Lys Leu Val Ala Asp 

20 25 30 

Ala Ala Ala Gin Gly Ala Thr Leu He Val Phe Gly Glu Thr Trp Phe 
35 40 45 
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Pro Gly Tyr Pro Ala Trp Leu Asp Tyr Cys Pro Asn Val Ala Leu Trp 

50 55 60 

Asn His Pro Pro Thr Lys Gin Val Phe Glu Arg Leu His Arg Asn Ser 
65 "70 75 80 

He Ala Val Pro Ser Lys Glu Leu Asp Phe Leu Gly Ala Leu Ala Arg 

85 90 95 

Lys His Gin Val Val Leu Val Leu Ser He Asn Glu Arg Val Glu Gin 

100 105 HO 

Gly Ala Gly His Gly Thr Leu Tyr Asn Thr Leu Leu Thr He Asp Ala 

115 120 125 

Asp Gly Thr Leu Ala Asn His His Arg Lys Leu Met Pro Thr Tyr Thr 

130 135 140 

Glu Arg Met Val Trp Gly Met Gly Asp Gly Val Gly Leu Gin Ala Val 
145 150 155 160 

Asp Thr Ala Val Gly Arg Val Gly Gly Leu He Cys Trp Glu His Trp 

165 I 70 17b 

Met Pro Leu Ala Arg Gin Thr Met His He Ser Gly Glu Gin He His 

180 185 190 

He Ser Val Phe Pro Thr Val His Glu Met His Gin He Ala Ser Arg 

195 200 205 

Gin Tyr Ala Phe Glu Gly Arg Thr Phe Val Leu Thr Val Gly Gly He 

210 215 220 

Leu Ala Ala Gin Asp Leu Pro Ala Glu Leu Glu Arg Pro Ala Asp Leu 
225 230 235 240 

Pro Pro Thr Gin Leu Val Gin Arg Gly Gly Ser Ala He He Ala Pro 

245 250 255 

Asp Gly Arg Tyr Leu Ala Gly Pro Val Tyr Asn Glu Glu Thr He Leu 

260 265 270 

Thr Ala Thr Leu Asp Leu Gly Glu He He Arg Glu Ser Met Thr Leu 

275 280 285 

Asp Val Thr Gly His Tyr Ala Arg Pro Asp Val Phe Asp Leu Thr Val 

290 295 300 

Lys Arg Ser Arg Pro 
305 

<210> 161 
<211> 1008 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

atjMclcca tccgcgccgc cgccgtgcag tttagcccgg tgctgtactc gcgccaggcc 
accgtcgaca agctgtgccg caccctgctg gaactgggcc gcgaaggggt gcagttcgcg 
gta?tcccgg aaaccgtggt gccgtactac ccatattttt ccttcgtgca Sccaccgttc 
gccatgggca aacaacacct gttgctgctc gagcaatccg tcactgtgcc ctctgacgtc 
Lccggcaga tcggtgaggc ctgccgggaa gcggggatcg tcgccagcat cggcgtcaac 
gaacgcgacg gcggcactat ttataacgcg cagttgctgt tcgatgccga cggcagcctg 
Ittcagcagc ggcgcaagat caccccgacc tatcacgaac gcatggtctg ggggcagggc 
Jatgg^tccg gccigcgcgc cgtggacagt gcggtggggc gtatcggttc cctggcctgc 
tgggaacatt acaaccccct ggcgcgctac gcgctgatgg ccgatggcga acagattcat 
g?ggcgatgt ttcccggctc cctggtcggc gacatctttg ccgagcagat cgaagtcacc 
atccgccacc acgccctgga aagcggctgc ttcgtggtca acgccacggc "ggctggat 
gccgaccagc agggccggat catgcaggac accggctgcg agttggggcc ^"^cggc 
aqctgtttta ccgcgatcat ttccccggag ggcaaggttc tcggcgagcc gctgcgcagc 
agcgaagggg tggtcattgc tgacctcgac ctggccctga tcgacaagcg caaacgcatg 
St?cgg tcggtcacta cagccgcccg gaactgctca gcctgcttat cgaccgcagc 
ccgaccgccc acgtgcatga acttgccgcc gcgcttaatc ctgccaggga gtctgatcca 
ctagtgtcga cctgcaggcg cgcgagctcc agcttttgtt ccctttag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1008 
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<210> 162 
<211> 335 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 162 



Met 


Thr 


i nr 


T 1 n 

lie 


Arg 


Aia 


Ala 


Ala 


val 


Gin 


Phe 


Ser 


Pro 


Val 


Leu 


Tyr 


1 








5 










10 










15 


Ser 


Arg 


<ain 


7\ 1 

Aia 


mr 


vai 


Asp 


Lys 


Leu 


Cys 


Arg 


Thr 


Leu 


Leu 


Glu 


Leu 


















25 










30 






Gly Arg 


f i 


Caly 


val 


tain 


Phe 


Ala 


T 7_ T 

vai 


Pne 


Pro 


Glu 


Thr 


Val 


Val 


Pro 






jj 










4 U 










45 








Tyr 


Tyr 


Pro 


Tyr 


Fne 


Ser 


pne 


vai 


Gin 


Pro 


Pro 


Phe 


Ala 


Met 


Gly 


Lys 




50 










JJ 










60 








Gin 


His 


Leu 


Leu 


Leu 


Leu 


CalU 


Cain 


Ser 


T 7 — T 

vai 


Thr 


val 


Pro 


Ser 


Asp 


Val 


65 










"7 A 

/ u 










/ O 








80 


Thr 


Arg 


(~Z~\ T-l 


i ie 


■aly 


blU 


Ala 


Cys 


Arg 


CalU 


Ala 


Caly 


lie 


r 7 _ -I 

Val 


Ala 


Ser 










o o 










y u 










95 




lie 


Gly 


val 


Asn 


v?l II 


Arg 


Asp 


biy 


biy 


inr 


lie 


Tyr 


Asn 


Ala 


Gin 


Leu 


















1 AC 

luo 










110 






Leu 


Phe 


Asp 


Ala 


Asp 


uiy 


Ser 


Leu 


lie 


Cain 


Cain 


Arg 


Arg 


Lys 


He 


Thr 






1 1 c 

X X D 










ion 










IOC 








Pro 


Thr 


iyr 


ulo 


UlU 


Arg 


Ma +■ 

Lyiet. 


Val 


Trp 


Caly 


Cain 


Caiy 


Asp 


Caly 


Ser 


Gly 




130 










IOC 

1 Jo 










14 0 










Leu 


Arg 


Aia 


v ai 


Asp 


Ser 


Ala 


Val 


(aly 


Arg 


lie 


Gly 


Ser 


Leu 


Ala 


Cys 


145 










loU 










155 










160 


Trp 


Glu 


ills 


Tyr 


Asn 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ala 


Leu 


Met 


Ala 


Asp 


Gly 










J. DD 










1 /U 










I/O 




Glu 


Gin 


lie 


rllS 


vai 


nia 


W6t 


Jrne 


Pro 


biy 


Ser 


Leu 


T7 _ T 

Val 


Gly 


Asp 


He 








IbU 










1 DO 










ly u 






Phe 


Ala 


Glu 


Gin 


He 


Glu 


Val 


Thr 


He 


Arg 


His 


His 


Ala 


Leu 


Glu 


Ser 






195 










200 










205 








Gly 


Cys 


Phe 


Val 


Val 


Asn 


Ala 


Thr 


Ala 


Trp 


Leu 


Asp 


Ala 


Asp 


Gin 


Gin 




210 










215 










220 










Gly Arg 


He 


Met 


Gin 


Asp 


Thr 


Gly 


Cys 


Glu 


Leu 


Gly 


Pro 


He 


Ser 


Gly 


225 










230 










235 










240 


Gly 


Cys 


Phe 


Thr 


Ala 


He 


He 


Ser 


Pro 


Glu 


Gly 


Lys 


Val 


Leu 


Gly 


Glu 










245 










250 










255 




Pro 


Leu 


Arg 


Ser 


Gly 


Glu 


Gly 


Val 


Val 


He 


Ala 


Asp 


Leu 


Asp 


Leu 


Ala 








260 










265 










270 






Leu 


He 


Asp 


Lys 


Arg 


Lys 


Arg 


Met 


Met 


Asp 


Ser 


Val 


Gly 


His 


Tyr 


Ser 






275 










280 










285 








Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


Leu 


Leu 


He 


Asp 


Arg 


Ser 


Pro 


Thr 


Ala 


His 




290 










295 










300 










Val 


His 


Glu 


Leu 


Ala 


Ala 


Ala 


Leu 


Asn 


Pro 


Ala 


Arg 


Glu 


Ser 


Asp 


Pro 


305 










310 










315 










320 


Leu 


Val 


Ser 


Thr 


Cys 


Arg 


Arg 


Ala 


Ser 


Ser 


Ser 


Phe 


Cys 


Ser 


Leu 












325 










330 










335 





<210> 163 
<211> 978 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
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<400> 163 

gtgaccatca tcaaagccgc cgcagtgcag atcagccccg tgctttacag ccgggaagcc 60 

accgtcgaaa aggtcgttcg cgagacccgc gaactcggcc agaagggcgt gcagttcgca 120 

acgtttccgg aaaccgtggt gccgtactac ccatacttct ccgccgtcca gacgggcatc 18 0 

gaactgctgt ccggcaaaga gcacctgcga ctgctggagc aggccgtgac tgttccttcc 2 40 

cccgccactg atgcgattgc ccaggcggca cgcgaggccg gcatggtggt gtcgatcggc 300 

gtcaacgagc gtgacggcgg caccatctac aacacgcagc tgctctttga tgccgacggc 360 

acgctggtgc agcgccgccg caagatcacg ccgacgcatt tcgagcgcat ggtgtggggc 4 20 

cagggcgacg gttcgggcct gcgcgcagtg gataccaagg tcggccgcat tggccagctg 480 

gcctgcttcg agcacaacaa cccgctcgcg cgctacgcaa tgatggccga tggcgagcag 540 

atccattcct ccatgtaccc gggctccgcc ttcggcgacg gattcgcgca gcgcatggag 600 

atcaacattc gccaacacgc cctggagtcg ggttgcttcg tggtgaatgc caccgcgtgg 660 

ctcgacgccg accagcaggc gcagatcatg aaggacacgg gctgcgccat cgggccgatc 72 0 

tctggcggct gcttcacgac catcgtcacg ccggacggca tgctgatcgg cgaacccctc 7 80 

cgcgagggcg agggcgagat catcgccgac ctcgatttca ccctgatcga ccgccgcaag 84 0 

ctgctgatgg actcggtcgg ccactacaac cgtccggagc tgctgagcct gctgatcgac 900 

cgcacacccg cggcgaactt ccatgagcgc agtacgcatc cggccgtcga tgccgccagc 960 

ggcctcgaaa tcctctaa 978 



<210> 164 
<211> 325 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 164 



Val 


Thr 


He 


He 


Lys 


Ala 


Ala 


Ala 


Val 


Gin 


He 


Ser 


Pro 


Val 


Leu 


Tyr 


1 








5 










10 










15 




Ser 


Arg 


Glu 


Ala 


Thr 


Val 


Glu 


Lys 


Val 


Val 


Arg 


Glu 


Thr 


Arg 


Glu 


Leu 








20 










25 










30 






Gly 


Gin 


Lys 


Gly 


Val 


Gin 


Phe 


Ala 


Thr 


Phe 


Pro 


Glu 


Thr 


Val 


Val 


Pro 






35 










40 










45 








Tyr 


Tyr 


Pro 


Tyr 


Phe 


Ser 


Ala 


Val 


Gin 


Thr 


Gly 


He 


Glu 


Leu 


Leu 


Ser 




50 










55 










60 










Gly 


Lys 


Glu 


His 


Leu 


Arg 


Leu 


Leu 


Glu 


Gin 


Ala 


Val 


Thr 


Val 


Pro 


Ser 


65 










70 










75 










80 


Pro 


Ala 


Thr 


Asp 


Ala 


He 


Ala 


Gin 


Ala 


Ala 


Arg 


Glu 


Ala 


Gly 


Met 


Val 










85 










90 










95 




Val 


Ser 


He 


Gly 


Val 


Asn 


Glu 


Arg 


Asp 


Gly 


Gly 


Thr 


He 


Tyr 


Asn 


Thr 








100 










105 










110 






Gin 


Leu 


Leu 


Phe 


Asp 


Ala 


Asp 


Gly 


Thr 


Leu 


Val 


Gin 


Arg 


Arg 


Arg 


Lys 






115 










120 










125 








He 


Thr 


Pro 


Thr 


His 


Phe 


Glu 


Arg 


Met 


Val 


Trp 


Gly 


Gin 


Gly 


Asp 


Gly 




130 










135 










140 










Ser 


Gly 


Leu 


Arg 


Ala 


Val 


Asp 


Thr 


Lys 


Val 


Gly 


Arg 


He 


Gly 


Gin 


Leu 


145 










150 










155 










160 


Ala 


Cys 


Phe 


Glu 


His 


Asn 


Asn 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ala 


Met 


Met 


Ala 










165 










170 










175 




Asp 


Gly 


Glu 


Gin 


He 


His 


Ser 


Ser 


Met 


Tyr 


Pro 


Gly 


Ser 


Ala 


Phe 


Gly 






180 










185 










190 






Asp 


Gly 


Phe 


Ala 


Gin 


Arg 


Met 


Glu 


He 


Asn 


He 


Arg 


Gin 


His 


Ala 


Leu 






195 










200 










205 








Glu 


Ser 


Gly 


Cys 


Phe 


Val 


Val 


Asn 


Ala 


Thr 


Ala 


Trp 


Leu 


Asp 


Ala 


Asp 




210 










215 










220 










Gin 


Gin 


Ala 


Gin 


He 


Met 


Lys 


Asp 


Thr 


Gly 


Cys 


Ala 


He 


Gly 


Pro 


He 


225 










230 










235 










240 


Ser 


Gly 


Gly 


Cys 


Phe 


Thr 


Thr 


He 


Val 


Thr 


Pro 


Asp 


Gly 


Met 


Leu 


He 










245 










250 










255 




Gly 


Glu 


Pro 


Leu 


Arg 


Glu 


Gly 


Glu 


Gly 


Glu 


He 


He 


Ala 


Asp 


Leu 


Asp 
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260 265 270 

Phe Thr Leu lie Asp Arg Arg Lys Leu Leu Met Asp Ser Val Gly His 

275 280 285 

Tyr Asn Arg Pro Glu Leu Leu Ser Leu Leu He Asp Arg Thr Pro Ala 

290 295 300 

Ala Asn Phe His Glu Arg Ser Thr His Pro Ala Val Asp Ala Ala Ser 
305 310 315 320 

Gly Leu Glu He Leu 
325 

<210> 165 
<211> 1008 
<212> DMA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 165 

atggccaatt tcaaattcaa ggcggcggcg gtgcaggccg cgcccgcttt cctcgatctc 60 

gaggctagca tcgccaagtc gatcgccctg atcgaacaag ccgccgccaa cggcgccaag 120 

ctgatcgcct ttcccgaagt cttcattccc ggctacccct ggcacatctg gctcgacagt 180 

cccgcctggg cgatcgggcg cggcttcgtc tcgcgctatt tcgagaactc gctggactac 240 

aacagccccg aggccgagcg cctcaggctc gccgtcaaga aggcgggcct gacggcggtg 300 

atcggcctct ccgagcgcga cggcggcagc ctctacatcg cgcaatggat catcggccct 360 

gacggcgaga ccgttgcgaa acggcgtaag ctccggccga cccattgcga gcgcacggtc 420 

tatggagaag gcgacggcag cgacctcgcg gttcacgacg tatctggcat cggccgtctc 4 80 

ggcgcgctct gctgctggga gcatatccag ccgctgtcga aattcgcgat gtattcgcaa 54 0 

aatgagcaag tgcacgtcgc gtcctggccg agcttctcgc tctacgaccc gttcgcgccg 600 

gcgctgggcg ccgaggtcaa caacgcagcc tcgcggatct atgcggtcga aggctcatgc 660 

ttcgtcattg cgccctgcgc gaccgtttcg cctgcaatga tcgaggaact gtgcgacgcg 72 0 

ccaaacaaac atgcgcttct gcacgcgggc ggcggcttcg cgcgcatcta tgggccggac 780 

ggcgcttcga tcgccgagac gctgccgcca gatcaggaag gcttgatcta cgccgacatc 84 0 

gacctcaccg cgatcggcgt cgccaaggcc gccgccgatc ccgccggcca ttattcgcgc 900 

cccgacgtca cgcgcctgct cttcaacaag aagcccgctc ggcgagtcga aacttttgct 960 

ttgcccgtcg atgcgccggc gccggagacg cagaccgccg cgagctga 1008 



<210> 166 
<211> 335 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 166 



Met 


Ala 


Asn 


Phe 


Lys 


Phe 


Lys 


Ala 


Ala 


Ala 


Val 


Gin 


Ala 


Ala 


Pro 


Ala 


1 








5 










10 










15 




Phe 


Leu 


Asp 


Leu 


Glu 


Ala 


Ser 


He 


Ala 


Lys 


Ser 


He 


Ala 


Leu 


He 


Glu 








20 










25 










30 






Gin 


Ala 


Ala 


Ala 


Asn 


Gly 


Ala 


Lys 


Leu 


He 


Ala 


Phe 


Pro 


Glu 


Val 


Phe 






35 










40 










45 








He 


Pro 


Gly 


Tyr 


Pro 


Trp 


His 


He 


Trp 


Leu 


Asp 


Ser 


Pro 


Ala 


Trp 


Ala 




50 










55 










60 










He 


Gly 


Arg 


Gly 


Phe 


Val 


Ser 


Arg 


Tyr 


Phe 


Glu 


Asn 


Ser 


Leu 


Asp 


Tyr 


65 










70 










75 










80 


Asn 


Ser 


Pro 


Glu 


Ala 


Glu 


Arg 


Leu 


Arg 


Leu 


Ala 


Val 


Lys 


Lys 


Ala 


Gly 










85 










90 










95 




Leu 


Thr 


Ala 


Val 


He 


Gly 


Leu 


Ser 


Glu 


Arg 


Asp 


Gly 


Gly 


Ser 


Leu 


Tyr 








100 










105 










110 






He 


Ala 


Gin 


Trp 


He 


He 


Gly 


Pro 


Asp 


Gly 


Glu 


Thr 


Val 


Ala 


Lys 


Arg 
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115 

Arg Lys Leu Arg 
130 

Asp Gly Ser Asp 
145 

Gly Ala Leu Cys 

Met Tyr Ser Gin 
180 

Ser Leu Tyr Asp 
195 

Ala Ala Ser Arg 
210 

Pro Cys Ala Thr 
225 

Pro Asn Lys His 

Tyr Gly Pro Asp 
260 

Glu Gly Leu lie 
275 

Lys Ala Ala Ala 
290 

Arg Leu Leu Phe 
305 

Leu Pro Val Asp 



120 

Pro Thr His Cys 

135 

Leu Ala Val His 
150 

Cys Trp Glu His 
165 

Asn Glu Gin Val 

Pro Phe Ala Pro 
200 

lie Tyr Ala Val 

215 

Val Ser Pro Ala 
230 

Ala Leu Leu His 
245 

Gly Ala Ser lie 

Tyr Ala Asp lie 
280 

Asp Pro Ala Gly 
295 

Asn Lys Lys Pro 
310 

Ala Pro Ala Pro 
325 



Glu Arg Thr Val 
140 

Asp Val Ser Gly 
155 

He Gin Pro Leu 
170 

His Val Ala Ser 
185 

Ala Leu Gly Ala 

Glu Gly Ser Cys 
220 

Met He Glu Glu 
235 

Ala Gly Gly Gly 
250 

Ala Glu Thr Leu 
265 

Asp Leu Thr Ala 

His Tyr Ser Arg 
300 

Ala Arg Arg Val 
315 

Glu Thr Gin Thr 
330 



125 








Tyr 


Gly 


Glu 


Gly 


He 


Gly Arg 


Leu 








160 


Ser 


Lys 


Phe 


Ala 






175 




Trp 


Pro 


Ser 


Phe 




190 






Glu 


Val 


Asn 


Asn 


205 








Phe 


Val 


He 


Ala 


Leu 


Cys 


Asp 


Ala 








240 


Phe 


Ala 


Arg 


He 






255 




Pro 


Pro 


Asp 


Gin 




270 






He 


Gly Val 


Ala 


285 








Pro 


Asp 


Val 


Thr 


Glu 


Thr 


Phe 


Ala 








320 


Ala 


Ala 


Ser 








335 





<210> 167 
<211> 1017 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 167 

atgggtattg 

gatcttgacg 

gccaagctga 

gactcgcccg 

gcctatgaca 

gccgtgatcg 

ggccccgacg 

actgtgtacg 

aggctcggtg 

gcgcagaacg 

gccgtggcgc 

tcctgcttcg 

gatcggccgg 

cccgacggca 

gagatcgatc 

tcgcggcccg 

ttctcgccgc 



aacatccgaa 
cgtcgatcga 
tcgcattccc 
cctgggcgat 
gcccgcaggc 
gcttgtcgga 
gcgagaccat 
gcgagggcga 
cgctctgctg 
agcaggtgca 
tcggcgccga 
tgctggcgcc 
acaagcacgc 
gccagatcgg 
tcggcgccat 
acgtgacgcg 
cgtcggaggc 



gtacagggtt 
caagtcgatc 
cgaggccttc 
tggccgcggt 
cgagaagctg 
gcgtgacggc 
cgcaaaacgg 
cggcagcgac 
ctgggagcat 
cgtcgcggcc 
ggtgaacaac 
gtgcgcgaca 
gctgctgcat 
cgacaagctc 
aggtgtcgcc 
gctgttgctc 
ggttgaaccc 



gccgtggtgc 
gcgctgatcg 
atccccggct 
tttgtgcagc 
cgcgcggccg 
ggcagccttt 
cgcaagctgc 
cttgcggtcc 
cttcagccac 
tggccgagct 
gcggcctccc 
gtctcgcaag 
gtcggcggcg 
gcccccgacc 
aagaacgccg 
aacaagaaac 
acggatatcg 



aggccgcacc 
aggaggctgc 
acccctggca 
gctacttcga 
tgcgcaaggc 
atctcgcaca 
ggccgacaca 
acaatcgtcc 
tgtcgaaata 
tttcgctcta 
gcgtctatgc 
ccatgatcga 
gctttgccgc 
aggagggcct 
cggatcccgc 
cgtacaagcg 
cggcggcggc 



ggcctggctc 
ccagaaaggc 
tatctggatg 
caattcgctg 
aaaactcacg 
atggctgatc 
tgccgagcgc 
ggacatcggc 
cgcgatgtac 
cgatcccttc 
ggtcgaaggc 
cgagctctgc 
gatctacggg 
gttgatcgcc 
cgggcactat 
cgtcgaacag 
aagctga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 

72a 

780 
840 
900 
960 
1017 



<210> 168 
<211> 338 
<212> PRT 
<213> Unknown 



<220> 
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<223> Obtained from an environmental sample 



<400> 168 



Met 


Gly 


He 


Glu 


His 


Pro 


Lys 


Tyr 


Arg 


Val 


Ala 


Val 


Val 


Gin 


Ala 


Ala 


1 








5 










10 










15 




Pro 


Ala 


Trp 


Leu 

20 


Asp 


Leu 


Asp 


Ala 


Ser 
25 


He 


Asp 


Lys 


Ser 


He 
30 


Ala 


Leu 


He 


Glu 


Glu 
35 


Ala 


Ala 


Gin 


Lys 


Gly 
40 


Ala 


Lys 


Leu 


He 


Ala 
45 


Phe 


Pro 


Glu 


Ala 


Phe 

50 


He 


Pro 


Gly 


Tyr 


Pro 
55 


Trp 


His 


He 


Trp 


Met 
60 


Asp 


Ser 


Pro 


Ala 


Trp 


Ala 


He 


Gly 


Arg 


Gly 


Phe 


Val 


Gin 


Arg 


Tyr 


Phe 


Asp 


Asn 


Ser 


Leu 


65 










70 










75 










80 


Ala 


Tyr 


Asp 


Ser 


Pro 


Gin 


Ala 


Glu 


Lys 


Leu 


Arg 


Ala 


Ala 


Val 


Arg 


Lys 






85 










90 










95 




Ala 


Lys 


Leu 


Thr 
100 


Ala 


Val 


He 


Gly 


Leu 
105 


Ser 


Glu 


Arg 


Asp 


Gly 
110 


Gly 


Ser 


Leu 


Tyr 


Leu 
115 


Ala 


Gin 


Trp 


Leu 


He 
120 


Gly 


Pro 


Asp 


Gly 


Glu 
125 


Thr 


He 


Ala 


Lys 


Arg 
130 


Arg 


Lys 


Leu 


Arg 


Pro 
135 


Thr 


His 


Ala 


Glu 


Arg 
140 


Thr 


Val 


Tyr 


Gly 


Glu 


Gly Asp 


Gly 


Ser 


Asp 


Leu 


Ala 


Val 


His 


Asn 


Arg 


Pro 


Asp 


He 


Gly 


145 










150 










155 










160 


Arg 


Leu 


Gly 


Ala 


Leu 


Cys 


Cys 


Trp 


Glu 


His 


Leu 


Gin 


Pro 


Leu 


Ser 


Lys 








165 










170 










175 




Tyr 


Ala 


Met 


Tyr 


Ala 


Gin 


Asn 


Glu 


Gin 


Val 


His 


Val 


Ala 


Ala Trp 


Pro 






180 










185 










190 






Ser 


Phe 


Ser 
195 


Leu 


Tyr 


Asp 


Pro 


Phe 
200 


Ala 


Val 


Ala 


Leu 


Gly 
205 


Ala 


Glu 


Val 


Asn 


Asn 
210 


Ala 


Ala 


Ser 


Arg 


Val 
215 


Tyr 


Ala 


Val 


Glu 


Gly 

220 


Ser 


Cys 


Phe 


Val 


Leu 


Ala 


Pro 


Cys 


Ala 


Thr 


Val 


Ser 


Gin 


Ala 


Met 


He 


Asp 


Glu 


Leu 


Cys 


225 










230 










235 










240 


Asp 


Arg 


Pro 


Asp 


Lys 


His 


Ala 


Leu 


Leu 


His 


Val 


Gly 


Gly 


Gly 


Phe 


Ala 








245 










250 










255 




Ala 


He 


Tyr 


Gly 


Pro 


Asp 


Gly 


Ser 


Gin 


He 


Gly Asp 


Lys 


Leu 


Ala 


Pro 








260 










265 










270 






Asp 


Gin 


Glu 


Gly 


Leu 


Leu 


He 


Ala 


Glu 


He 


Asp 


Leu 


Gly 


Ala 


He 


Gly 




275 










280 










285 








Val 


Ala 
290 


Lys 


Asn 


Ala 


Ala 


Asp 
295 


Pro 


Ala 


Gly 


His 


Tyr 
300 


Ser 


Arg 


Pro 


Asp 


Val 


Thr 


Arg 


Leu 


Leu 


Leu 


Asn 


Lys 


Lys 


Pro 


Tyr 


Lys 


Arg 


Val 


Glu 


Gin 


305 










310 










315 










320 


Phe 


Ser 


Pro 


Pro 


Ser 
325 


Glu 


Ala 


Val 


Glu 


Pro 
330 


Thr 


Asp 


He 


Ala 


Ala 
335 


Ala 



Ala Ser 



<210> 169 
<211> 1077 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 169 

atggccctca cccatccgaa attgaaagtc gccgccgtgc aggcagctcc cgcgttcctc 
gatgtcgatg ccgcagtgga caaagcggtg cggctaatcg acgaagcggc agcaaacggc 
tccagtctgg tggcattccc cgagacctgg atccccggct atccgttttg gatctggctt 
ggctcgccgg. cctgggcaat catgcgcggg tttgtgtctc gctatttcga taattcgctc 
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agctatgaca gccggcaggc agagcgcctg cgcgacgccg cgaagcgcca caaactgacc 30 0 

gtcgtcatgg gcctgtccga gcgcgccggc ggtagccttt acatcgcgca gtggatcatt 3 60 

ggtcccaatg gcgagaccgt cgcacagcgg cgcaagctca agcccaccca tgcggagcgc 420 

accgtcttcg gcgagggtga cggcagccac ctggcggtac acaatcttcc aatcggacgg 4 80 

ctcggtgcgc tgtgctgctg ggagcacctc cagccgctct ccaaatacgc gatgtacgcc 540 

cagaacgaag agatccacgt ggcggcatgg ccgtccttct cgctctacga cccgtttgcg 600 

cacgcgctcg gcgccgaagt caacaacgca gcgagccaga tctacgcggt tgaaggttcc 660 

tgctttgtcg tcgcgccatg tgcggtgatc tcgcaggaaa tgatcgatct tatgtgcgat 720 

acccccgaca agcatcagct tattcacgtc ggtggcggct tcaccgtgat ctatggcccg 780 

gacggtgcgc gcatcggcga caagctcgcg ccagatcagg aaggcattgt ctatgccgac 8 40 

atcgatctcg gcatgatccc gatcgcgaaa gctgccgccg atcctgccgg ccactatgcg 900 

cgacccgacg ttacccgcct tctgttcaac aatcgtcccg ccaatcgggt ggaaaccctc 960 

gtgctccccg ttgatcaggt ccgtgacatc gatgcacgtg tggaggccgc ggcacctcag 1020 

gcgcgaccag caaccgggaa cgaggatccc gccgcaaagc ctatggccgc cgaatga 1077 



<210> 170 
<211> 358 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 170 



Met 


Ala 


Leu 


Thr 


His 


Pro 


Lys 


Leu 


Lys 


Val 


Ala 


Ala 


Val 


Gin 


Ala 


Ala 


1 








5 










10 










15 




Pro 


Ala 


Phe 


Leu 


Asp 


Val 


Asp 


Ala 


Ala 


Val 


Asp 


Lys 


Ala 


Val 


Arg 


Leu 








20 










25 










30 






lie 


Asp 


Glu 


Ala 


Ala 


Ala 


Asn 


Gly 


Ser 


Ser 


Leu 


Val 


Ala 


Phe 


Pro 


Glu 






35 










40 










45 








Thr 


Trp 


He 


Pro 


Gly 


Tyr 


Pro 


Phe 


Trp 


He 


Trp 


Leu 


Gly 


Ser 


Pro 


Ala 




50 










55 










60 










Trp 


Ala 


He 


Met 


Arg 


Gly 


Phe 


Val 


Ser 


Arg 


Tyr 


Phe 


Asp 


Asn 


Ser 


Leu 


65 










70 










75 










80 


Ser 


Tyr 


Asp 


Ser 


Arg 


Gin 


Ala 


Glu 


Arg 


Leu 


Arg 


Asp 


Ala 


Ala 


Lys 


Arg 










85 










90 










95 




His 


Lys 


Leu 


Thr 


Val 


Val 


Met 


Gly 


Leu 


Ser 


Glu 


Arg 


Ala 


Gly 


Gly 


Ser 






100 










105 










110 






Leu 


Tyr 


He 


Ala 


Gin 


Trp 


He 


He 


Gly 


Pro 


Asn 


Gly 


Glu 


Thr 


Val 


Ala 






115 










120 










125 








Gin 


Arg 


Arg 


Lys 


Leu 


Lys 


Pro 


Thr 


His 


Ala 


Glu 


Arg 


Thr 


Val 


Phe 


Gly 




130 










135 










140 










Glu 


Gly 


Asp 


Gly 


Ser 


His 


Leu 


Ala 


Val 


His 


Asn 


Leu 


Pro 


He 


Gly 


Arg 


145 








150 










155 










160 


Leu 


Gly 


Ala 


Leu 


Cys 


Cys 


Trp 


Glu 


His 


Leu 


Gin 


Pro 


Leu 


Ser 


Lys 


Tyr 










165 










170 










175 




Ala 


Met 


Tyr 


Ala 


Gin 


Asn 


Glu 


Glu 


He 


His 


Val 


Ala 


Ala 


Trp 


Pro 


Ser 






180 










185 










190 






Phe 


Ser 


Leu 


Tyr 


Asp 


Pro 


Phe 


Ala 


His 


Ala 


Leu 


Gly 


Ala 


Glu 


Val 


Asn 






195 










200 










205 








Asn 


Ala 


Ala 


Ser 


Gin 


He 


Tyr 


Ala 


Val 


Glu 


Gly 


Ser 


Cys 


Phe 


Val 


Val 




210 










215 










220 










Ala 


Pro 


Cys 


Ala 


Val 


He 


Ser 


Gin 


Glu 


Met 


He 


Asp 


Leu 


Met 


Cys 


Asp 


225 








230 










235 










240 


Thr 


Pro 


Asp 


Lys 


His 


Gin 


Leu 


He 


His 


Val 


Gly 


Gly 


Gly 


Phe 


Thr 


Val 








245 










250 










255 




He 


Tyr 


Gly 


Pro 


Asp 


Gly 


Ala 


Arg 


He 


Gly 


Asp 


Lys 


Leu 


Ala 


Pro 


Asp 








260 










265 










270 






Gin 


Glu 


Gly 


He 


Val 


Tyr 


Ala 


Asp 


He 


Asp 


Leu 


Gly 


Met 


He 


Pro 


He 






275 










280 










285 








Ala 


Lys 


Ala 


Ala 


Ala 


Asp 


Pro 


Ala 


Gly 


His 


Tyr 


Ala 


Arg 


Pro 


Asp 


Val 
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290 










295 










300 










Thr 


Arg 


Leu 


Leu 


Phe 


Asn 


Asn 


Arg 


Pro 


Ala 


Asn 


Arg 


Val 


Glu 


Thr 


Leu 


305 










310 










315 










320 


Val 


Leu 


Pro 


Val 


Asp 
325 


Gin 


Val 


Arg 


Asp 


lie 
330 


Asp 


Ala 


Arg 


Val 


Glu 
335 


Ala 


Ala 


Ala 


Pro 


Gin 
340 


Ala 


Arg 


Pro 


Ala 


Thr 
345 


Gly 


Asn 


Glu 


Asp 


Pro 
350 


Ala 


Ala 


Lys 


Pro 


Met 
355 


Ala 


Ala 


Glu 























<210> 171 
<211> 1011 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 171 

atgactaagg gaacagtgaa agtcgcggcg gcgcaggtca cccccgtgtt catggatcgt 60 

aaagccacga tcgtcaaggc ctgcgacacg atcgccgagg cgggcaagaa cggcgcgcgg 120 

ctggtggtgt tcccggagac gtttgttcct ggctacccgg actgggtctg gacggcgacg 18 0 

gctgggacgc atcgcgatat ccaccaggcg atgtacgcgg aactgctgga ccaggctgtc 2 40 

tcgattccga gcccggcgac ggacgccctc tgccgtgctg caaagaaggc gggcgtctac 300 

gtcgtcatcg gcgtcaatga gctgagtggg ccgggcggaa gcctgtacaa cacgctgatc 360 

tacatcgatg acgaaggcga gatcatgggc cgccaccgca agctggtccc cacgatgggc 420 

gagcgcctgg tctgggcacc cggcgacggc agcacgctgg aggcgtacga gacatcgatc 4 80 

ggcaggctgg gcggactgat ctgctgggag aactacatgc cgctggcccg ctacgccatg 540 

tacgcctggg gcgtgcagat ctacgtcgcg ccgacgtggg acagctcgga cgggtgggtt 600 

ggcagcatgc agcacatcgc ccgcgaaggg cggacggcgg tgatcggctg ctgcatggcg 660 

atccgtcgca gcgacatccc ggacaagtac gagttcaaga agctgtaccc gccgagcaag 72 0 

agcaaagacg aagaatgggt gaacgatggc aacagcgtca tcgtcgcacc cggtggacga 78 0 

atactcgccg ggccggtcgc caaagaggag acgatcctct acgccgatct ggacccggca 840 

gccgagcgcg gttcaaagtt ctcgttagat gtggcagggc actacgcgcg gccggacgtc 900 

ttccagctga cggtgaatcg cggtccggca gaactggtga atgtggccgg tgatatcgca 960 

ccggcaacca acggcaaagt caaaacaccg gcgaaattac gccgcaagta a 1011 

<210> 172 
<211> 336 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 172 



Met 


Thr 


Lys 


Gly 


Thr 


Val 


Lys 


Val 


Ala 


Ala 


Ala 


Gin 


Val 


Thr 


Pro 


Val 


1 




5 










10 










15 




Phe 


Met 


Asp 


Arg 


Lys 


Ala 


Thr 


He 


Val 


Lys 


Ala 


Cys 


Asp 


Thr 


He 


Ala 






20 










25 










30 






Glu 


Ala 


Gly 
35 


Lys 


Asn 


Gly 


Ala 


Arg 
40 


Leu 


Val 


Val 


Phe 


Pro 
45 


Glu 


Thr 


Phe 


Val 


Pro 
50 


Gly 


Tyr 


Pro 


Asp 


Trp 
55 


Val 


Trp 


Thr 


Ala 


Thr 

60 


Ala 


Gly 


Thr 


His 


Arg 


Asp 


He 


His 


Gin 


Ala 


Met 


Tyr 


Ala 


Glu 


Leu 


Leu 


Asp 


Gin 


Ala 


Val 


65 








70 










75 










80 


Ser 


He 


Pro 


Ser 


Pro 
85 


Ala 


Thr 


Asp 


Ala 


Leu 
90 


Cys 


Arg 


Ala 


Ala 


Lys 
95 


Lys 


Ala 


Gly 


Val 


Tyr 


Val 


Val 


He 


Gly 


Val 


Asn 


Glu 


Leu 


Ser 


Gly 


Pro 


Gly 






100 










105 










110 






Gly 


Ser 


Leu 


Tyr 


Asn 


Thr 


Leu 


He 


Tyr 


He 


Asp 


Asp 


Glu 


Gly 


Glu 


He 
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115 

Met Gly Arg His Arg 
130 

Trp Ala Pro Gly Asp 
145 

Gly Arg Leu Gly Gly 
165 

Arg Tyr Ala Met Tyr 
180 

Trp Asp Ser Ser Asp 
195 

Glu Gly Arg Thr Ala 
210 

Asp lie Pro Asp Lys 

225 

Ser Lys Asp Glu Glu 
245 

Pro Gly Gly Arg lie 
260 

Leu Tyr Ala Asp Leu 
275 

Leu Asp Val Ala Gly 
290 

Val Asn Arg Gly Pro 
305 

Pro Ala Thr Asn Gly 
325 



Leu He Cys Trp Glu 
170 

Ala Trp Gly Val Gin 
185 

Gly Trp Val Gly Ser 
200 

Val He Gly Cys Cys 
215 

Tyr Glu Phe Lys Lys 
230 

Trp Val Asn Asp Gly 
250 

Leu Ala Gly Pro Val 
265 

Asp Pro Ala Ala Glu 
280 

His Tyr Ala Arg Pro 
295 

Ala Glu Leu Val Asn 
310 

Lys Val Lys Thr Pro 
330 



120 

Lys Leu Val Pro Thr 
135 



Gly Ser Thr Leu Glu 
150 



125 

Met Gly Glu Arg Leu Val 
140 

Ala Tyr Glu Thr Ser He 
155 160 
Asn Tyr Met Pro Leu Ala 
175 

He Tyr Val Ala Pro Thr 
190 

Met Gin His He Ala Arg 
205 

Met Ala He Arg Arg Ser 

220 

Leu Tyr Pro Pro Ser Lys 
235 240 
Asn Ser Val He Val Ala 
255 

Ala Lys Glu Glu Thr He 
270 

Arg Gly Ser Lys Phe Ser 
285 

Asp Val Phe Gin Leu Thr 
300 

Val Ala Gly Asp He Ala 
315 320 
Ala Lys Leu Arg Arg Lys 
335 



<210> 173 

<211> 993 

<212> D3SJA 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 173 

atgaaagttg ttaaagccgc cgcagtccaa ctgagtcccg tcctctatag ccgcgaggga 60 

acggtcgaga gggtcgttcg gaagatccac gagcttggcc ggcagggggt acagttcgcc 120 

accttcccgg agaccgtagt gccttactac ccgtactttt ccttcgtcca gacgccctta 180 

cagattatag ccggacctga gcatctaaag ctgctcgacc aggcagtgac cgtgccgtcc 240 

cctgccaccg acgctatcag cgaggctgcc aggcaggcgg gagttgtggt gtccataggc 300 

gtcaacgagc gtgacggcgg aaccctgtac aacacgcagc tgctcttcga tgccgatggc 360 

gccttgatcc agcgccgccg caagattacg cccactcatt tcgagcgcat gatctggggc 42 0 

cagggcgacg ggtcgggcct gcgcgctgtc gacagcaagg tcggtcgcat tggccagctc 4 80 

gcatgctggg agcacaacaa ccccctggcg cgctacgcga tgatagccga cggcgagcag 54 0 

atccattcgg caatgtatcc gggctccatg ttcggcgacc cgtttgccca gaagacggaa 600 

atcaatatcc ggcagcatgc attggagtct gcgtgcttcg tcgtgtgcgc cacggcctgg 660 

ctggacgccg atcagcaggc gcaaatctgc aaggacactg gctgcgacat cggcccgatc 720 

tccggcggtt gcttcaccgc gatcgtggcg cctgatggaa ccttgctggg cgagcccatc 7 80 

cgctcgggcg aaggcatggt catcgtcgac ctcgacttca cgctcatcga caagcgcaag 840 

caggtgatgg actcgcgcgg ccactacaac cggccggaat tgctcagtct cctgatcgac 900 

cgcacaccca ctgcgcatgt tcacgaccgc gctgtgcgcc ccgagtcagc cgcggagcaa 960 

cgttcggagg aacttctcgc tacggctgtc taa 993 

<210> 174 
<211> 330 
<212> PRT 
<213> Unknown 

<220> 
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<223> Obtained from an environmental sample 



<400> 174 
Met Lys Val Val 
1 

Ser Arg Glu Gly 
20 

Gly Arg Gin Gly 
35 

Tyr Tyr Pro Tyr 
50 

Gly Pro Glu His 
65 

Pro Ala Thr Asp 

Val Ser lie Gly 
100 

Gin Leu Leu Phe 
115 

He Thr Pro Thr 
130 

Ser Gly Leu Arg 
145 

Ala Cys Trp Glu 

Asp Gly Glu Gin 
180 

Asp Pro Phe Ala 
195 

Glu Ser Ala Cys 
210 

Gin Gin Ala Gin 

225 

Ser Gly Gly Cys 

Gly Glu Pro He 
260 

Phe Thr Leu He 
275 

Tyr Asn Arg Pro 
290 

Ala His Val His 
305 

Arg Ser Glu Glu 



Lys Ala Ala Ala 
5 

Thr Val Glu Arg 

Val Gin Phe Ala 
40 

Phe Ser Phe Val 
55 

Leu Lys Leu Leu 
70 

Ala He Ser Glu 
85 

Val Asn Glu Arg 

Asp Ala Asp Gly 
120 

His Phe Glu Arg 
135 

Ala Val Asp Ser 
150 

His Asn Asn Pro 
165 

He His Ser Ala 

Gin Lys Thr Glu 

200 

Phe Val Val Cys 
215 

He Cys Lys Asp 
230 

Phe Thr Ala He 
245 

Arg Ser Gly Glu 

Asp Lys Arg Lys 
280 

Glu Leu Leu Ser 
295 

Asp Arg Ala Val 
310 

Leu Leu Ala Thr 
325 



Val Gin Leu Ser 
10 

Val Val Arg Lys 
25 

Thr Phe Pro Glu 

Gin Thr Pro Leu 
60 

Asp Gin Ala Val 
75 

Ala Ala Arg Gin 
90 

Asp Gly Gly Thr 
105 

Ala Leu He Gin 

Met He Trp Gly 
140 

Lys Val Gly Arg 
155 

Leu Ala Arg Tyr 
170 

Met Tyr Pro Gly 
185 

He Asn He Arg 

Ala Thr Ala Trp 

220 

Thr Gly Cys Asp 
235 

Val Ala Pro Asp 
250 

Gly Met Val He 
265 

Gin Val Met Asp 

Leu Leu He Asp 
300 

Arg Pro Glu Ser 
315 

Ala Val 
330 



Pro Val Leu Tyr 
15 

He His Glu Leu 
30 

Thr Val Val Pro 
45 

Gin He He Ala 

Thr Val Pro Ser 
80 

Ala Gly Val Val 
95 

Leu Tyr Asn Thr 
110 

Arg Arg Arg Lys 
125 

Gin Gly Asp Gly 

He Gly Gin Leu 
160 

Ala Met He Ala 
175 

Ser Met Phe Gly 
190 

Gin His Ala Leu 

205 

Leu Asp Ala Asp 

He Gly Pro He 
240 

Gly Thr Leu Leu 
255 

Val Asp Leu Asp 
270 

Ser Arg Gly His 
285 

Arg Thr Pro Thr 

Ala Ala Glu Gin 
320 



<210> 175 
<211> 945 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 175 

atgaccacct 

agggaagcat 

acgctggcgg 

gttccgactg 

accgaccgat 

gaactggatg 



acgggcagtt 
ccaccgaaaa 
cgtttggcga 
ctctccgagt 
tgtgcgcggc 
cgcagacaca 



tagacttgcc 
agcttgccgg 
gacgtggctt 
ggaatacatc 
ggctcgtcag 
cgggacggtc 



gcgatcaatg 
ctcattctcg 
cccggctatc 
gctaatgccg 
gcgaacatcg 
tactgtacgc 



ccgcgcctgt 
aagcgggggc 
cattccacat 
tcgagattcc 
atgttgtgat 
tcttgttcat 



ctacttcgac 
atcaggggcg 
ctggagggaa 
cagcccaacg 
cggcgttgtc 
tggcagcgat 



60 
120 
180 
240 
300 
360 
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ggctcaattc tgggacgtca tcgaaagatt aaaccgactt tcgtggagcg aaccgcatgg 420 

ggggaaggtg acggcagcag cctgatcgtc tacgagcgcc cgtatggcaa gatcagtggt 4 80 

ctgtgttgct gggaacacaa tatggttctg ccgggctacg cgctgatggc gcaggggacg 540 

cagattcata tcgccgcatg gcccggctgg gaaagcactc gccatctgct cttatcaaga 600 

gcattcgctt ctcaggcagc ggcgtatgtg attgatgtag gcgctatcgt caatcgtgac 660 

gaccttcggg aagattacca ggctttgatt gctggaagct actggggcgg aagttgcatc 720 

atcaacccag aaggcgaggt catcgctggt ccagcgaaat cggagaccat tctggttgca 780 

gattgctcaa ccgagcagat ctttagctca aaagtgctct gtgatgtggg cgggcattat 840 

tctcgcccgg atatttttca gctccatgtc aatcgaaagc catatcaacg tatcgtcgag 900 

acgaacaacc cacaccccgc tccgattgag ttcgattacc gttga 945 



<210> 176 
<211> 314 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 176 



Met 


Thr 


Thr 


Tyr 


Gly 


Gin 


Phe 


Arg 


Leu 


Ala 


Ala 


He 


Asn 


Ala 


Ala 


Pro 


1 








5 










10 










15 




Val 


Tyr 


Phe 


Asp 


Arg 


Glu 


Ala 


Ser 


Thr 


Glu 


Lys 


Ala 


Cys 


Arg 


Leu 


He 








20 










25 










30 






Leu 


Glu 


Ala 


Gly 


Ala 


Ser 


Gly 


Ala 


Thr 


Leu 


Ala 


Ala 


Phe 


Gly 


Glu 


Thr 






35 










40 










45 








Trp 


Leu 


Pro 


Gly 


Tyr 


Pro 


Phe 


His 


He 


Trp 


Arg 


Glu 


Val 


Pro 


Thr 


Ala 




50 










55 










60 










Leu 


Arg 


Val 


Glu 


Tyr 


He 


Ala 


Asn 


Ala 


Val 


Glu 


lie 


Pro 


Ser 


Pro 


Thr 


65 










70 










75 










80 


Thr 


Asp 


Arg 


Leu 


Cys 


Ala 


Ala 


Ala 


Arg 


Gin 


Ala 


Asn 


He 


Asp 


Val 


Val 










85 










90 










95 




He 


Gly Val 


Val 


Glu 


Leu 


Asp 


Ala 


Gin 


Thr 


His 


Gly 


Thr 


Val 


Tyr 


Cys 








100 










105 










110 






Thr 


Leu 


Leu 


Phe 


He 


Gly 


Ser 


Asp 


Gly 


Ser 


lie 


Leu 


Gly 


Arg 


His 


Arg 






115 










120 










125 








Lys 


He 


Lys 


Pro 


Thr 


Phe 


Val 


Glu 


Arg 


Thr 


Ala 


Trp 


Gly 


Glu 


Gly 


Asp 


130 










135 










140 










Gly 


Ser 


Ser 


Leu 


He 


Val 


Tyr 


Glu 


Arg 


Pro 


Tyr 


Gly 


Lys 


He 


Ser 


Gly 


145 










150 










155 










160 


Leu 


Cys 


Cys 


Trp 


Glu 


His 


Asn 


Met 


Val 


Leu 


Pro 


Gly 


Tyr 


Ala 


Leu 


Met 








165 










170 










175 




Ala 


Gin 


Gly 


Thr 


Gin 


He 


His 


He 


Ala 


Ala 


Trp 


Pro 


Gly 


Trp 


Glu 


Ser 








180 










185 










190 






Thr 


Arg 


His 


Leu 


Leu 


Leu 


Ser 


Arg 


Ala 


Phe 


Ala 


Ser 


Gin 


Ala 


Ala 


Ala 




195 










200 










205 








Tyr 


Val 


He 


Asp 


Val 


Gly 


Ala 


He 


Val 


Asn 


Arg 


Asp 


Asp 


Leu 


Arg 


Glu 




210 










215 










220 










Asp 


Tyr 


Gin 


Ala 


Leu 


He 


Ala 


Gly 


Ser 


Tyr 


Trp 


Gly 


Gly 


Ser 


Cys 


He 


225 










230 










235 










240 


He 


Asn 


Pro 


Glu 


Gly 


Glu 


Val 


He 


Ala 


Gly 


Pro 


Ala 


Lys 


Ser 


Glu 


Thr 










245 










250 










255 




He 


Leu 


Val 


Ala 


Asp 


Cys 


Ser 


Thr 


Glu 


Gin 


He 


Phe 


Ser 


Ser 


Lys 


Val 








260 








265 










270 






Leu 


Cys 


Asp 


Val 


Gly 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 


He 


Phe 


Gin 


Leu 






275 










280 










285 








His 


Val 


Asn 


Arg 


Lys 


Pro 


Tyr 


Gin 


Arg 


He 


Val 


Glu 


Thr 


Asn 


Asn 


Pro 




290 










295 










300 










His 


Pro 


Ala 


Pro 


He 


Glu 


Phe 


Asp 


Tyr 


Arg 














305 










310 
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<210> 177 
<211> 948 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 177 

atggactcac ttaccgttgg tctcgcgcaa atcgcgcccg tttggttgaa 
acgttgtcaa agatgttgga acaggttcga gcggcgaaag aggcgggctg 
gtgtttggtg aggcgttgct ccccggttat ccattttgga tcgaactgac 
gtcttcaatt cgccgatgca aaaggaaatc cacgcgcact acatggatca 
atcgaagcag ggcatcttga tccattgtgc ggcgcggcaa aagcgcacgg 
gtcgcgggca tcatcgagcg tccgttggat cgcggcggac atagtttata 
gtgtatatcg atttgaacgg tgtcatccaa tcggtgcatc gcaaactgat 
gaagaacgac tcacctggtc gcctggcgat ggtcatgggt tacgcgtgca 
gcttttacgg ttggcaaact caattgttgg gaaaactgga tgccgctgcc 
ctgtatgcgc aaggcgaaga tctgcacgtt gctgtctggc ccgggtccgt 
caggatatta cgcgctttat cgcaatggag tcgcgatcgt ttgtcgtttc 
ttgatgcgca agagtgactt cccacaagat acgcctcatc tctccgccat 
gcacccgatc cactcgccaa cggaggttcg tgtctggctg gacctgacgg 
gttgaaccgg ttgcggatga agagaagttg atcgtcgcca ccattgacca 
cgtgaagaac gccagaactt tgatccatcc ggtcattaca gccgaccaga 
ttgagagtca accgccagcg acaaagcgtt atcgcttttg atgagtag 

<210> 178 
<211> 315 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 178 



Met 


Asp 


Ser 


Leu 


Thr 


Val 


Gly 


Leu 


Ala 


Gin 


He 


Ala 


Pro 


Val 


Trp 


Leu 


1 






5 










10" 










15 




Asn 


Arg 


Ala 


Gly 


Thr 


Leu 


Ser 


Lys 


Met 


Leu 


Glu 


Gin 


Val 


Arg 


Ala 


Ala 






20 










25 










30 






Lys 


Glu 


Ala 


Gly 


Cys 


Gin 


Leu 


Val 


Val 


Phe 


Gly 


Glu 


Ala 


Leu 


Leu 


Pro 




35 










40 










45 








Gly 


Tyr 


Pro 


Phe 


Trp 


He 


Glu 


Leu 


Thr 


Asn 


Gly Ala 


Val 


Phe 


Asn 


Ser 


50 










55 










60 










Pro 


Met 


Gin 


Lys 


Glu 


He 


His 


Ala 


His 


Tyr 


Met 


Asp 


Gin 


Ala 


Val 


Gin 


65 








70 










75 










80 


He 


Glu 


Ala 


Gly 


His 


Leu 


Asp 


Pro 


Leu 


Cys 


Gly Ala 


Ala 


Lys 


Ala 


His 








85 










90 










95 




Gly 


He 


Thr 


Val 


Val 


Ala 


Gly 


He 


He 


Glu 


Arg 


Pro 


Leu 


Asp 


Arg 


Gly 






100 










105 










110 






Gly 


His 


Ser 


Leu 


Tyr 


Ala 


Ser 


Leu 


Val 


Tyr 


He 


Asp 


Leu 


Asn 


Gly 


Val 




115 










120 










125 








He 


Gin 
130 


Ser 


Val 


His 


Arg 


Lys 
135 


Leu 


Met 


Pro 


Thr 


Tyr 
140 


Glu 


Glu 


Arg 


Leu 


Thr 


Trp 


Ser 


Pro 


Gly 


Asp 


Gly 


His 


Gly 


Leu 


Arg 


Val 


His 


Thr 


Leu 


Gly 


145 








150 










155 










160 


Ala 


Phe 


Thr 


Val 


Gly 


Lys 


Leu 


Asn 


Cys 


Trp 


Glu 


Asn 


Trp 


Met 


Pro 


Leu 










165 








170 










175 




Pro 


Arg 


Ala 


Ala 


Leu 


Tyr 


Ala 


Gin 


Gly 


Glu 


Asp 


Leu 


His 


Val 


Ala 


Val 






180 










185 










190 






Trp 


Pro 


Gly 


Ser 


Val 


Arg 


Asn 


Thr 


Gin 


Asp 


He 


Thr 


Arg 


Phe 


He 


Ala 




195 










200 










205 









tcgggcgggg 60 

tcagcttgtt 120 

gaacggcgca 180 

agctgtgcag 24 0 

catcaccgtg 300 

tgcgagtctg 3 60 

gcccacctat 420 

tacactgggc 48 0 

gcgcgcggct 54 0 

gcgcaacaca 600 

ggtttcgagt 660 

tcttgaatct 720 

taaatggatc 780 

tgcccgtgta 8 40 
tgtgacacaa - 900 
948 
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Met 


Glu 


Ser 


Arg 


Ser 


Phe 


Val 


Val 


Ser 


Val 


Ser 


Ser 


Leu 


Met 


Arg 


Lys 




210 










215 










220 










Ser 


Asp 


Phe 


Pro 


Gin 


Asp 


Thr 


Pro 


His 


Leu 


Ser 


Ala 


He 


Leu 


Glu 


Ser 


225 










230 










235 










240 


Ala 


Pro 


Asp 


Pro 


Leu 


Ala 


Asn 


Gly 


Gly 


Ser 


Cys 


Leu 


Ala 


Gly 


Pro 


Asp 










245 










250 










255 




Gly 


Lys 


Trp 


He 


Val 


Glu 


Pro 


Val 


Ala 


Asp 


Glu 


Glu 


Lys 


Leu 


He 


Val 








260 










265 










270 






Ala 


Thr 


He 


Asp 


His 


Ala 


Arg 


Val 


Arg 


Glu 


Glu 


Arg 


Gin 


Asn 


Phe 


Asp 






275 










280 










285 








Pro 


Ser 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 


Val 


Thr 


Gin 


Leu 


Arg 


Val 


Asn 




290 










295 










300 










Arg 


Gin 


Arg 


Gin 


Ser 


Val 


He 


Ala 


Phe 


Asp 


Glu 












305 










310 










315 













<210> 179 
<211> 915 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 179 

atgaccacca 

gtcgataagg 

ctcggcgaga 

ctgtgggatc 

atccccggcg 

gtgctcagcg 

gagcgcggcg 

gtctgggggc 

ggcgggctga 

ggggagcaaa 

agccgccagt 

gccgacgacc 

cagggcggcg 

gacgaggaaa 

acgctggacg 

cagcggaaaa 



aagcagccat 
cgcttgccct 
cgtggctgcc 
acgcgccgac 
cggagatcag 
tcaacgagcg 
acatccgcaa 
agggcgacgg 
tctgctggga 
ttcacgtttc 
acgctttcga 
taccgccgga 
gcagcgccat 
ccatcctcta 
tgagcgggca 
tttag 



cattcaggcg 
catcaccgag 
gggctatccc 
caaagccgtc 
ccagttctgc 
cgtccgcaac 
ccaccaccgc 
cgcgggctta 
acactggatg 
ggtcttcccg 
ggggcgctgc 
actgaccgtc 
catcgcgccg 
cgccgacctc 
ttactcgcgc 



cgccccatat 
gcggcggcac 
gcgtggctgg 
tttcagcggc 
gacatcgccc 
accttgttca 
aagctgatgc 
caggcggtcg 
ccgctggcac 
accgtcaacg 
ttcgtgctga 
aaggcgggca 
gacatgcgct 
gacctgagcg 
cccgacgtgt 



actacgatct 
gcggcgcgaa 
atgtgtgcgt 
tccatgccaa 
gccgccttag 
acaccctgct 
cgacctacac 
agacggcaac 
ggcaggcgct 
acccgcgcca 
ccgccggcag 
tcgcgccgga 
acctcgccgg 
agacgatccg 
tcaccttcga 



ggcggcgtgt 
catcgtcacg 
cgagatgggg 
cagcgtcacc 
catcgtgctg 
cacgattgac 
tgagcgcatc 
cgggcgcgtc 
gcacaacgcc 
ccaagtcgcc 
catccagcgc 
tgatctggtg 
accctgcttc 
cgagagcatg 
ggttaatcgg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
915 



<210> 180 
<211> 304 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 180 



Met 


Thr 


Thr 


Lys 


Ala 


Ala 


He 


He 


Gin 


Ala 


Arg 


Pro 


He 


Tyr 


Tyr 


Asp 


1 








5 










10 










15 




Leu 


Ala 


Ala 


Cys 
20 


Val 


Asp 


Lys 


Ala 


Leu 
25 


Ala 


Leu 


He 


Thr 


Glu 
30 


Ala 


Ala 


Ala 


Arg 


Gly 
35 


Ala 


Asn 


He 


Val 


Thr 
40 


Leu 


Gly 


Glu 


Thr 


Trp 
45 


Leu 


Pro 


Gly 


Tyr 


Pro 


Ala 


Trp 


Leu 


Asp 


Val 


Cys 


Val 


Glu 


Met 


Gly 


Leu 


Trp 


Asp 


His 


50 










55 










60 










Ala 


Pro 


Thr 


Lys 


Ala 


Val 


Phe 


Gin 


Arg 


Leu 


His 


Ala 


Asn 


Ser 


Val 


Thr 


65 








70 










75 










80 


He 


Pro 


Gly 


Ala 


Glu 


He 


Ser 


Gin 


Phe 


Cys 


Asp 


He 


Ala 


Arg 


Arg 


Leu 
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85 90 95 

Ser He Val Leu Val Leu Ser Val Asn Glu Arg Val Arg Asn Thr Leu 

100 105 HO 

Phe Asn Thr Leu Leu Thr He Asp Glu Arg Gly Asp He Arg Asn His 

115 120 125 

His Arg Lys Leu Met Pro Thr Tyr Thr Glu Arg He Val Trp Gly Gin 

130 135 140 

Gly Asp Gly Ala Gly Leu Gin Ala Val Glu Thr Ala Thr Gly Arg Val 
145 150 155 160 

Gly Gly Leu He Cys Trp Glu His Trp Met Pro Leu Ala Arg Gin Ala 

165 170 175 

Leu His Asn Ala Gly Glu Gin He His Val Ser Val Phe Pro Thr Val 

180 185 190 

Asn Asp Pro Arg His Gin Val Ala Ser Arg Gin Tyr Ala Phe Glu Gly 

195 200 205 

Ara Cvs Phe Val Leu Thr Ala Gly Ser He Gin Arg Ala Asp Asp Leu 

210 215 220 

Pro Pro Glu Leu Thr Val Lys Ala Gly He Ala Pro Asp Asp Leu Val 
225 230 235 240 

Gin Gly Gly Gly Ser Ala He He Ala Pro Asp Met Arg Tyr Leu Ala 

245 250 255 

Glv Pro Cys Phe Asp Glu Glu Thr He Leu Tyr Ala Asp Leu Asp Leu 

260 265 270 

Ser Glu Thr He Arg Glu Ser Met Thr Leu Asp Val Ser Gly His Tyr 

275 280 285 

Ser Arg Pro Asp Val Phe Thr Phe Glu Val Asn Arg Gin Arg Lys He 
290 295 300 

<210> 181 
<211> 990 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<400> 181 ^ . 

atgcccaaga ccgtacgcgc agccgcagtc cagatcgcgc ccgacctgac gtcacgcgct 
ggcaccgtcg agcgggtcct gaatgcgatt gccgaagcct ctgacaaagg cgcggagctg 
attgtatttc cagaaacctt tgtgccttgg tatccgtatt tcagcttcgt tctgccgcct 
gtccagcagg gacccgagca cctgcggctt tatgaagaag cggtgacggt accatcagca 
gaaacacggg ccgtcgccga cgccgcgcgc aaacgcaatg cggtcatcgt ccttggcgtc 
aatgagcgcg accacggctc gctctacaac acccagctga tcttcgacgc ggatggcagc 
ctgaaactca agcgccgcaa gatcacgccc acctatcacg agcggatgat ctggggacag 
ggcgatggcg ccggtctaaa agtggtcgaa actgccatcg gccgcatggg cgcattggcg 
tgctgggagc actacaaccc cctcgcccga tacgcgctga tggctcagca tgaggaaatt 
cacgcctctc attttccggg ctcactggtc ggcccgatat tcggcgagca gatcgaagtc 
acgatgcgcc accacgcgtt ggaatcgggc tgtttcgtgg tcaatgccac cggctggtta 
agcgaggagc agatcgcgtc cattcaccca gatcccagcc tgcagaaggg tcttcgagat 
ggctgcatga cctgcatcat aaccccggaa ggccgccacg tcgttcctcc tctgacatcg 
qgtgaaggaa tcctgattgg cgacctggac atgcggctca tcaccaagcg caagcgaatg 
atggattccg tcggacacta tgcacgtcct gagctgctgc accttgtcca tgacacgacg 
cccgcacgcg cacgcgagca ggtgggcctt tcaggcgatt tttccgatgc agggcaagac 
aagctatttg aggaggttca agatgcgtga 

<210> 182 

<211> 329 

<212> PRT 

<213> Unknown 

<220> 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 

990 
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<223> Obtained from an environmental sample 



<400> 182 



Met 


Pro 


Lys 


Thr 


Val 


Arg 


Ala 


Ala 


Ala 


Val 


Gin 


He 


Ala 


Pro 


Asp 


Leu 


1 






5 










10 










15 




Thr 


Ser 


Arg 


Ala 
20 


Gly 


Thr 


Val 


Glu 


Arg 
25 


Val 


Leu 


Asn 


Ala 


He 
30 


Ala 


Glu 


Ala 


Ser 


Asp 
35 


Lys 


Gly 


Ala 


Glu 


Leu 
40 


He 


Val 


Phe 


Pro 


Glu 
45 


Thr 


Phe 


Val 


Pro 


Trp 
50 


Tyr 


Pro 


Tyr 


Phe 


Ser 
55 


Phe 


Val 


Leu 


Pro 


Pro 
60 


Val 


Gin 


Gin 


Gly 


Pro 


Glu 


His 


Leu 


Arg 


Leu 


Tyr 


Glu 


Glu 


Ala 


Val 


Thr 


Val 


Pro 


Ser 


Ala 


65 










70 










75 










80 


Glu 


Thr 


Arg 


Ala 


Val 
85 


Ala 


Asp 


Ala 


Ala 


Arg 
90 


Lys 


Arg 


Asn 


Ala 


Val 
95 


He 


Val 


Leu 


Gly 


Val 


Asn 


Glu 


Arg 


Asp 


His 


Gly 


Ser 


Leu 


Tyr 


Asn 


Thr 


Gin 






100 










105 










110 






Leu 


He 


Phe 
115 


Asp 


Ala 


Asp 


Gly 


Ser 
120 


Leu 


Lys 


Leu 


Lys 


Arg 
125 


Arg 


Lys 


He 


Thr 


Pro 
130 


Thr 


Tyr 


His 


Glu 


Arg 
135 


Met 


He 


Trp 


Gly 


Gin 
140 


Gly 


Asp 


Gly 


Ala 


Gly 


Leu 


Lys 


Val 


Val 


Glu 


Thr 


Ala 


He 


Gly 


Arg 


Met 


Gly 


Ala 


Leu 


Ala 


145 










150 










155 










160 


Cys 


Trp 


Glu 


His 


Tyr 
165 


Asn 


Pro 


Leu 


Ala 


Arg 
170 


Tyr 


Ala 


Leu 


Met 


Ala 
175 


Gin 


His 


Glu 


Glu 


He 
180 


His 


Ala 


Ser 


His 


Phe 
185 


Pro 


Gly 


Ser 


Leu 


Val 
190 


Gly 


Pro 


He 


Phe 


Gly 
195 


Glu 


Gin 


He 


Glu 


Val 
200 


Thr 


Met 


Arg 


His 


His 
205 


Ala 


Leu 


Glu 


Ser 


Gly 
210 


Cys 


Phe 


Val 


Val 


Asn 
215 


Ala 


Thr 


Gly 


Trp 


Leu 

220 


Ser 


Glu 


Glu 


Gin 


He 


Ala 


Ser 


He 


His 


Pro 


Asp 


Pro 


Ser 


Leu 


Gin 


Lys 


Gly 


Leu 


Arg 


Asp 


225 










230 










235 










240 


Gly 


Cys 


Met 


Thr 


Cys 


He 


He 


Thr 


Pro 


Glu 


Gly Arg 


His 


Val 


Val 


Pro 






245 










250 










255 




Pro 


Leu 


Thr 


Ser 


Gly 


Glu 


Gly 


He 


Leu 


He 


Gly Asp 


Leu 


Asp 


Met 


Arg 








260 










265 










270 






Leu 


He 


Thr 
275 


Lys 


Arg 


Lys 


Arg 


Met 
280 


Met 


Asp 


Ser 


Val 


Gly 
285 


His 


Tyr 


Ala 


Arg 


Pro 
290 


Glu 


Leu 


Leu 


His 


Leu 
295 


Val 


His 


Asp 


Thr 


Thr 
300 


Pro 


Ala 


Arg 


Ala 


Arg 


Glu 


Gin 


Val 


Gly 


Leu 


Ser 


Gly 


Asp 


Phe 


Ser 


Asp 


Ala 


Gly 


Gin 


Asp 


305 










310 










315 










320 


Lys 


Leu 


Phe 


Glu 


Glu 
325 


Val 


Gin 


Asp 


Ala 

















<210> 183 
<211> 1002 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 183 

atggctgagt cacgcattat tcgtgccgcc gccgcccaga 

gccagcaaaa cgctggcccg ggtgctggac gcgatcgatc 

gagatcatcg tctttcccga gacctttgtg ccttattacc 

cccgcgatga ccgccggagc ggcccatctg aaattgtatg 

ggcccgatca cccatgcggt gggcgaacgc gcccgcctgc 

ggggtgaatg aacgtgacca cggcacgctc tacaacaccc 



ttgcgccaga 


tctccatgag 


60 


aggcagccgc 


acagggggca 


120 


cctacttctc 


gtttatcacg 


180 


accaggcggt 


ggtggtgccc 


240 


gcaacatcgt 


cgtggtgctg 


300 


aactggtatt 


tgatgccagc 


360 
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ggggaactgg 
ggacagggag 
ttagcctgct 
gagatccatt 
gacgtcacca 
tggctgaccg 
cgtggtggct 
accgaaggtg 
cgcatgatgg 
gcgacgcctg 
gatgcagaac 



tgctgaaacg 
acggtgccgg 
gggagcacta 
gcagccagtt 
ttcgccatca 
aggagcagat 
gcaacaccgc 
aggggatttt 
attctgtcgg 
cccgttatgt 
gtaccgtcta 



ccgcaaaatc 
attaaaggtg 
caacccactg 
ccctggttca 
tgcactggag 
caacgagctg 
catcatctcg 
gattgccgat 
ccactatgcc 
ggtggcgcgt 
cgcgccagca 



accccgacct 
gtggactcgg 
gcgcgctaca 
ctggtggggc 
tccggttgct 
accagcgacc 
ccggaaggcc 
ctggacatgg 
cgaccggaat 
gataatgagt 
gctgatcact 



atcacgaacg 
cggttgggcg 
gcctgatgac 
cgatttttgc 
ttgtcatcaa 
cggcgttaca 
gccatctggt 
ccctgatcac 
tactcagcct 
ccgaaaccgg 
ga 



gatgatctgg 
catcggggct 
tcagcacgag 
cgagcagatg 
tgccaccggc 
aaaggggctg 
gccgccactg 
caaacgcaaa 
gcgcctcgat 
aggaggcaac 



<210> 184 
<211> 333 
<212> PRT 
<213> Unknown 

<220> 

<22 3> Obtained from an environmental sample 



<400> 184 

Met Ala Glu Ser Arg 

1 5 
Asp Leu His Glu Ala 
20 

Asp Gin Ala Ala Ala 
35 

Phe Val Pro Tyr Tyr 
50 

Ala Gly Ala Ala His 
65 

Gly Pro lie Thr His 
85 

Val Val Val Leu Gly 
100 

Thr Gin Leu Val Phe 

. 115 
Lys lie Thr Pro Thr 
130 

Gly Ala Gly Leu Lys 
145 

Leu Ala Cys Trp Glu 
165 

Thr Gin His Glu Glu 
180 

Gly Pro lie Phe Ala 
195 

Leu Glu Ser Gly Cys 
210 

Glu Gin lie Asn Glu 
225 

Arg Gly Gly Cys Asn 
245 

Val Pro Pro Leu Thr 
260 

Met Ala Leu lie Thr 
275 

Tyr Ala Arg Pro Glu 
290 

Arg Tyr Val Val Ala 
305 



lie He Arg 

Ser Lys Thr 

Gin Gly Ala 
40 

Pro Tyr Phe 
55 

Leu Lys Leu 
70 

Ala Val Gly 

Val Asn Glu 

Asp Ala Ser 
120 

Tyr His Glu 
135 

Val Val Asp 
150 

His Tyr Asn 

He His Cys 

Glu Gin Met 
200 

Phe Val He 

215 
Leu Thr Ser 
230 

Thr Ala He 

Glu Gly Glu 

Lys Arg Lys 
280 

Leu Leu Ser 

295 
Arg Asp Asn 
310 



Ala Ala 

10 
Leu Ala 
25 

Glu He 

Ser Phe 

Tyr Asp 

Glu Arg 
90 

Arg Asp 
105 

Gly Glu 

Arg Met 

Ser Ala 

Pro Leu 
170 
Ser Gin 
185 

Asp Val 

Asn Ala 

Asp Pro 

He Ser 
250 
Gly He 
265 

Arg Met 
Leu Arg 
Glu Ser 



Ala Ala Gin He 
Arg Val 
He Val 



He Thr 

60 
Gin Ala 
75 

Ala Arg 

His Gly 

Leu Val 

He Trp 
140 
Val Gly 
155 

Ala Arg 

Phe Pro 

Thr He 

Thr Gly 
220 
Ala Leu 
235 

Pro Glu 

Leu He 

Met Asp 

Leu Asp 
300 
Glu Thr 
315 



Leu Asp 

30 
Phe Pro 
45 

Pro Ala 



Val Val 

Leu Arg 

Thr Leu 
110 
Leu Lys 
125 

Gly Gin 

Arg He 

Tyr Ser 

Gly Ser 
190 
Arg His 
205 

Trp Leu 

Gin Lys 

Gly Arg 

Ala Asp 
270 
Ser Val 
285 

Ala Thr 
Gly Gly 



Ala Pro 
15 

Ala He 

Glu Thr 

Met Thr 

Val Pro 
80 

Asn He 
95 

Tyr Asn 

Arg Arg 

Gly Asp 

Gly Ala 
160 
Leu Met 
175 

Leu Val 

His Ala 

Thr Glu 

Gly Leu 
240 
His Leu 
255 

Leu Asp 

Gly His 

Pro Ala 

Gly Asn 
320 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1002 
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Asp Ala Glu Arg Thr Val Tyr Ala Pro Ala Ala Asp His 
325 330 

<210> 185 
<211> 1017 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 185 

atgggcattg 

gatctcgacg 

gcgaagctga 

gactcgccgg 

tcctatgaca 

gccgtgttcg 

gggcccgatg 

accgtctatg 

cggatcggcg 

gcccagaacg 

gcgccggcgt 

tcctgcttcg 

gaccggcccg 

cccgacggca 

gagatcgacc 

tcgcgtcccg 

ttcgcgctgc 



aacatccgaa 
gctcggtcga 
tcgcctttcc 
cctgggcgat 
gtccgcaggc 
gactgtccga 
gcgagaccat 
gcgaaggcga 
cgctctgctg 
aacaggtcca 
taggggccga 
tgctcgcgcc 
acaagaacgc 
gccagatcgg 
ttggcgccat 
acgtgacgcg 
cggtcgacac 



atacaaggtc 
taagtcgatc 
cgaggccttc 
cggccgcggc 
cgagcggctg 
gcgcgacggc 
cgccaagcgc 
cggcagcgat 
ctgggagcat 
tgtcgcagcc 
ggtcaacaac 
gtgcgcgacg 
gctgctgcac 
cgacaagctg 
cggtgtcgcc 
gttgctgctc 
cgtcgagccg 



gcggtggtgc 
gcgctgatca 
atccccggtt 
ttcgtgcagc 
cgcgatgcgg 
ggcagcctct 
cgcaagctgc 
cttgccgtgc 
ctgcagccac 
tggcccagct 
gcggcctccc 
gtgtcgcagg 
gtcggcggcg 
gcgccggacc 
aagaacgccg 
aacaagaagc 
gcggatatcg 



aggcggcccc 
aggaggcggc 
acccctggca 
gttatttcga 
tgaagaaggc 
acctcgcgca 
ggccgaccca 
atgcgcgcgc 
tgtcgaaata 
tctcgctgta 
gcgtctatgc 
cgatgatcga 
gctttgccgc 
aggaggggct 
ccgatcccgc 
gataccagcg 
gcgcagcggc 



cgcctggctc 
cgagaagggg 
tatctggatg 
caattcgctg 
gaagctcacc 
atggctgatc 
cgccgaacgt 
cgacatcggc 
cgcgatgtac 
cgaccccttc 
ggtggaaggc 
cgagctctgc 
gatctatggc 
gctgatcgcc 
cgggcactat 
cgtcgagcag 
gagctga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1017 



<210> 186 
<211> 338 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from 



an environmental sample 



<400> 186 

Met Gly lie Glu His Pro 

1 5 
Pro Ala Trp Leu Asp Leu 
20 

lie Lys Glu Ala Ala Glu 
35 

Ala Phe lie Pro Gly Tyr 
50 

Trp Ala lie Gly Arg Gly 
65 70 
Ser Tyr Asp Ser Pro Gin 
85 

Ala Lys Leu Thr Ala Val 
100 

Leu Tyr Leu Ala Gin Trp 
115 

Lys Arg Arg Lys Leu Arg 
130 

Glu Gly Asp Gly Ser Asp 
145 150 
Arg lie Gly Ala Leu Cys 
165 



Lys Tyr Lys 
Asp 
Lys 



Pro 

55 

Phe 

Ala 

Phe 

Leu 

Pro 
135 
Leu 

Cys 



Gly Ser 

25 
Gly Ala 
40 

Trp His 



Val Gin 

Glu Arg 

Gly Leu 
105 
He Gly 
120 

Thr His 
Ala Val 
Trp Glu 



Val Ala 
10 

Val Asp 

Lys Leu 

He Trp 

Arg Tyr 

75 
Leu Arg 
90 

Ser Glu 

Pro Asp 

Ala Glu 

His Ala 
155 
His Leu 
170 



Val Val 

Lys Ser 

He Ala 

45 
Met Asp 
60 

Phe Asp 

Asp Ala 

Arg Asp 

Gly Glu 
125 
Arg Thr 
140 

Arg Ala 
Gin Pro 



Gin Ala Ala 
15 

He Ala Leu 
30 

Phe Pro Glu 

Ser Pro Ala 

Asn Ser Leu 
80 

Val Lys Lys 
95 

Gly Gly Ser 
110 

Thr He Ala 

Val Tyr Gly 

Asp He Gly 
160 

Leu Ser Lys 
175 
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Tyr 


Ala 


Met 


Tyr 


Ala 


Gin 


Asn 


Glu 


Gin 


Val 


His 


Val 


Ala 


Ala 


Trp 


Pro 






180 










185 










190 






Ser 


Phe 


Ser 


Leu 


Tyr 


Asp 


Pro 


Phe 


Ala 


Pro 


Ala 


Leu 


Gly 


Ala 


Glu 


Val 






195 








200 










205 








Asn 


Asn 


Ala 


Ala 


Ser 


Arg 


Val 


Tyr 


Ala 


Val 


Glu 


Gly 


Ser 


Cys 


Phe 


Val 




210 










215 










220 










Leu 


Ala 


Pro 


Cys 


Ala 


Thr 


Val 


Ser 


Gin 


Ala 


Met 


He 


Asp 


Glu 


Leu 


Cys 


225 








230 










235 










240 


Asp 


Arg 


Pro 


Asp 


Lys 


Asn 


Ala 


Leu 


Leu 


His 


Val 


Gly 


Gly 


Gly 


Phe 


Ala 






245 










250 










255 




Ala 


He 


Tyr 


Gly 


Pro 


Asp 


Gly 


Ser 


Gin 


He 


Gly 


Asp 


Lys 


Leu 


Ala 


Pro 






260 










265 










270 




Gly 


Asp 


Gin 


Glu 


Gly 


Leu 


Leu 


He 


Ala 


Glu 


He 


Asp 


Leu 


G iy 


Ala 


He 




275 








280 










285 








Val 


Ala 


Lys 


Asn 


Ala 


Ala 


Asp 


Pro 


Ala 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 




290 








295 










300 








Gin 


Val 


Thr 


Arg 


Leu 


Leu 


Leu 


Asn 


Lys 


Lys 


Arg 


Tyr 


Gin 


Arg 


Val 


Glu 


305 








310 










315 










320 


Phe 


Ala 


Leu 


Pro 


Val 


Asp 


Thr 


Val 


Glu 


Pro 


Ala 


Asp 


He 


Gly 


Ala 


Ala 








325 








330 










335 




Ala 


Ser 































<210> 187 
<211> 1059 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 187 

atgggcatca 

gacctggatg 

tgcaagctca 

ggagctccgg 

gcgtacgaca 

gcagttctcg 

ggacctgatg 

accgtcttcg 

cgactgggtg 

gcgcaagacg 

gcgcacgccc 

tcgtgctacg 

gacagcgagg 

cccgatggca 

gagatcgatc 

tcgcggcccg 

atgctgctcc 

tccgagggac 



atcatcccaa 
gaacagtcga 
ttgcatttcc 
cctggacgat 
gtccgcaggc 
gcttgtcgga 
gcgagaccat 
gcgagggcga 
ccctttgctg 
agcaagtgca 
tggggtggga 
tcctggcccc 
acaagaagcc 
ccctgcttac 
tgggggcaat 

atgtcacccg 
cggtcgacgc 
gccagcgaca 



gtacaaagtg 
caagtgcatt 
cgagacgttc 
cgggcgcgga 
aaacaagctt 
gcgccgcgga 
cgctcaacgg 
tggcagcgat 
ctgggaacat 
cgtcgctgca 
gacgaacaac 
ctgcgccatc 
gctggttcat 
tcccaagctt 
cggggtcgcc 
cctgctcttc 
ggcagaagtc 
gttcaagctg 



gccgtcgtgc 
cggctgatag 
atcccggggt 
ttcgtgcagc 
cgcgccgcgg 
ggctccctgt 
cgaaagctgc 
ttggcggtgc 
ctccagcctt 
tggccgagct 
gcggtgagca 
atctctcagg 
gccggcgggg 
gcagaagacg 
aagaacgcgg 
aacaaccggc 
gtggagccgg 
cccgcctag 



aagcggcacc 
gcgaggccgc 
acccctggca 
gatacttcga 
tgaagcgcgc 
acatcgccca 
gccccaccca 
acagccgccc 
tgaccaagta 
tctcgatgta 
aggtgtacgc 
cgatggtgga 
ggcatgcggt 
aggagggcct 
cagaccccgc 
cggccaagcg 
cggacggagc 



tgtctggctc 
tgagaaggga 
catctggatg 
caattcgctt 
cggagtgacg 
gtggctcatc 
tgcggagcgc 
cgacatcggc 
cgcgatgtac 
cgagcctttc 
ggtcgaaggt 
cgaactcgtc 
gatctatggt 
actgatcgcg 
cggccactac 
cgtggagacg 
gctcaatgcg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1059 



<210> 188 
<211> 352 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<400> 188 n _ _ _ 

Met Gly He Asn His Pro Lys Tyr Lys Val Ala Val Val Gin Ala Ala 
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Pro 


val 


Trp 


Leu 


Asp 


Leu 


Asp 


P 1 17 

biy 








9 n 
z u 










1 ie 


Gly 


Glu 


7\ *1 -a 

Aia 


Ala 


Pin 
La 1U 


Lys 


biy 






35 










zi n 


Thr 


Phe 


He 


Pro 


biy 


Tyr 


Pro 


Trp 




50 














Trp 


Thr 


He 


biy 


Arg 


uiy 


irne 


V a.1 












7 n 






Ala 


Tyr 


Asp 


Ser 


Pro 


P 1 T-l 

bin 


fiia 


Asn 










o o 








Ala 


Gly Val 


Thr 


Ala 


val 


Leu 


biy 








inn 
l u u 










Leu 


Tyr 


He 


Ala 


bin 


Trp 


T All 

Leu 


lie 






115 










1 9 n 


Gin 


Arg 


Arg 


Lys 


Leu 


Arg 


Pro 


Thr 




130 










i m 




Glu 


Gly Asp 


Gly 


Ser 


Asp 


Leu 


Ala 


i yi c 

140 










1 ^n 






Arg 


Leu 


Gly 


Ala 


Leu 


Cys 


Cys 


Trp 










J. DJ 








Tyr 


Ala 


Met 


Tyr 


Aia 


bin 


Asp 


p.i n 

UlU 








i on 










Ser 


Phe 


Ser 


ixi e l. 


Tyr 


p.i n 

bill 




IT 11 






195 












Asn 


Asn 


Ala 


Vdl 


Ser 


T 

Lys 


Vdl 


iyr 




210 










9 1 H 




Leu 


Ala 


Pro 


Cys 


Ala 


Tic* 

lie 


lie 


Co v> 


O O Q 

ZZD 










9 "3 n 






Asp 


Ser 


Glu 


Asp 


T 

Lys 


Lys 


JrX f 


Leu 










9 A ^ 
^ Q 








Vdl 


He 


Tyr 


PI \r 

L^iy 


Pro 


Asp 




X 11 J. 


















Asp 


Glu 


Glu 


Gly 


Leu 


Leu 


He 


Ala 






275 










280 


Val 


Ala 


Lys 


Asn 


Ala 


Ala 


Asp 


Pro 




290 










295 




Val 


Thr 


Arg 


Leu 


Leu 


Phe 


Asn 


Asn 


305 










310 






Met 


Leu 


Leu 


Pro 


Val 


Asp 


Ala 


Ala 










325 








Ala 


Leu 


Asn 


Ala 


Ser 


Glu 


Gly 


Arg 



340 





10 










15 




Thr 


Val 


Asp 


Lys 


Cvs 


He 


Arq 


Leu 


9 R 










30 






Cys 


Lys 


Leu 


He 


Ala 


Phe 


Pro 


Glu 










45 








His 


He 




Met 


Gly 


Ala 


Pro 


Ala 








D \J 










P,1 n 


Arg 


iyx 


Phe 


Asp 


Asn 


Ser 


Leu 






7 S 










80 


Lys 


Leu 


Arg 


Ala 


Ala 


Val 


Lys 


Arg 




q n 










95 




Leu 




P,l n 


7\ -v- /~T 


nL y 


Civ 
uiy 


Gly 


Ser 


1 UO 










XXL/ 






biy 


Pro 


Asp 


P,1 \7 


P,1 n 

LjX U 


J. 11X 


He 


Ala 










i oi; 

X £L <D 








111 s 


nld 


P.1 n 
oX u. 


Arg 


JL 11X 


Val 


Phe 


Gly 








i 4 n 

1 1 u 










V ct X 


li J. o 


O CJ -L 


Arg 


Pro 


Asp 


He 


Gly 






1 J J 










160 


Pin 
blU 


111 o 


Leu 


CI n 


ID r~ 

t 1 U 


T 

Leu 


Thr 


Lys 




1 / u 










175 






Val 


His 


Val 


Ala 


Ala 


Trt) 
i ±.y 


Pro 


lO J 










190 






r\J-d 




nJ.d 


Leu 


Gly 




Glu 


Thr 










205 








xt.Xci 


Val 


Glu 


Gly 


Ser 


Cys 


Tvr 


Val 








220 










Gin 


Ala 


Met 


Val 


Asp 


Glu 


Leu 


Val 






235 










240 


Val 


His 


Ala 


Gly 


Glv 


Glv 


His 


Ala 




<L ~J \J 










255 




Leu 


Leu 


J. 11X 


Pro 


Lys 


Leu 


Ala 


Glu 












270 






OX U 


He 


Asp 


Leu 


Gly 


Ala 


He 


Glv 










285 








Ala 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 








300 










Arg 


Pro 


Ala 


Lys 


Arg 


Val 


Glu 


Thr 






315 










320 


Glu 


Val 


Val 


Glu 


Pro 


Ala 


Asp 


Gly 




330 










335 




Gin 


Arg 


Gin 


Phe 


Lys 


Leu 


Pro 


Ala 


345 










350 







<210> 189 
<211> 1005 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 189 

atgcaagaca 

ccagcctccg 

ctgctggcgt 

ccggcgtggg 

cccgaaatcc 

ttctccgaaa 

aagatcatct 

gaaggagatg 



cgaaattcaa 
tggccaaggc 
tcccggaggt 
gaatgcagtt 
tcgctctttg 
tcgacggagg 
tcaagcgccg 
ggtctgattt 



agttgcagtc 
gatcggtttc 
ctggattccg 
tgtgccgcgc 
tgcggccgcc 
aacgctctac 
aaagctcaag 
ccgagtcgtc 



gtccaggccg 
atccaggagg 
ggctaccctt 
tatcacgcca 
gccgaagcga 
ctaagtcagg 
ccgacccacg 
gacagcagcg 



cgccggtatt 
cgggcgcagc 
ggtggctttg 
attcgctgcg 
agatcaacgt 
ttttcatcag 
tcgaacgtac 
tcgggcgcct 



catggatgcg 
cggggcgaag 
gctcgggacg 
tgctgatgga 
cgtgatgggc 
cgatgcgggc 
gctatttggt 
cggagccctg 
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tgctgtgccg aacacattca gccgttgtcg aaatacgcca tgtacgcgat gaacgagcaa 
attcatgtgg cgtcgtggcc atctttcacg ctctatcgcg gcaaagccta cgctttgggt 
catgaggtga atcttgccgc cagccaaatc tacgcgctcg aaggaggttg cttcgtcttg 
catgccacgg caattaccgg tcaggatatg ttcgacatgc tttgcgacac tccggaaagg 
gcggatttgc tgaatgcgga gggagcaaag ccgggtggag gctattcgat gatttttggt 
cccgatggtc agccgatgtg cgagcatctg ccgcaggaca aggaaggcat cctctatgcc 
ggcgtagacc tgtcgatgat tgcgatcgcc aaagcggcct acgatcctac ggggcactac 
gcccgcggtg atgtcgtccg tctcatggtc aaccgcagcc cccgtcgcac gagcgtcagc 
ttcagcgaag acgagaacgc ggcggtcact ttcaccgaga cctga 

<210> 190 
<211> 334 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

Me?°Gln 9 Asp Thr Lys Phe Lys Val Ala Val Val Gin Ala Ala- Pro Val 

i 5 10 

Phe Met Asp Ala Pro Ala Ser Val Ala Lys Ala He Gly Phe He Gin 

20 25 30 

Glu Ala Gly Ala Ala Gly Ala Lys Leu Leu Ala Phe Pro Glu Val Trp 

35 40 45 

He Pro Gly Tyr Pro Trp Trp Leu Trp Leu Gly Thr Pro Ala Trp Gly 

50 55 60 

Met Gin Phe Val Pro Arg Tyr His Ala Asn Ser Leu Arg Ala Asp Gly 

70 75 
Pro Glu lie Leu Ala Leu Cys Ala Ala Ala Ala Glu Ala Lys lie Asn 

85 90 95 e 

Val Val Met Gly Phe Ser Glu He Asp Gly Gly Thr Leu Tyr Leu Ser 

100 105 HO 

Gin Val Phe He Ser Asp Ala Gly Lys He He Phe Lys Arg Arg Lys 

115 120 125 

Leu Lys Pro Thr His Val Glu Arg Thr Leu Phe Gly Glu Gly Asp Gly 

130 135 140 

Ser Asp Phe Arg Val Val Asp Ser Ser Val Gly Arg Leu Gly Ala Leu 
145 150 ^.55 

Cys Cys Ala Glu His He Gin Pro Leu Ser Lys Tyr Ala Met Tyr Ala 

165 I 70 * 75 w 

Met Asn Glu Gin He His Val Ala Ser Trp Pro Ser Phe Thr Leu Tyr 

180 185 190 

Arg Gly Lys Ala Tyr Ala Leu Gly His Glu Val Asn Leu Ala Ala Ser 

195 200 205 

Gin lie Tyr Ala Leu Glu Gly Gly Cys Phe Val Leu His Ala Thr Ala 

210 215 220 

lie Thr Gly Gin Asp Met Phe Asp Met Leu Cys Asp Thr Pro Glu Arg 

230 235 cLQ>j 

fll Asp Leu Leu Asn Ala Glu Gly Ala Lys Pro Gly Gly Gly Tyr Ser 

245 250 Zb ^> 

Met lie Phe Gly Pro Asp Gly Gin Pro Met Cys Glu Bis Leu Pro Gin 

260 265 270 

Asp Lys Glu Gly He Leu Tyr Ala Gly Val Asp Leu Ser Met He Ala 

275 280 285 

He Ala Lys Ala Ala Tyr Asp Pro Thr Gly His Tyr Ala Arg Gly Asp 

290 295 300 n o 

Val Val Arg Leu Met Val Asn Arg Ser Pro Arg Arg Thr Ser Val Ser 
305 310 315 JZU 

Phe Ser Glu Asp Glu Asn Ala Ala Val Thr Phe Thr Glu Thr 
325 33 0 
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<210> 191 
<211> 945 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 191 

atgaaaaagt 

gcgttggcgg 

acggaagcct 

tgggggacaa 

tcggatttgg 

attaatgaac 

tgtcatgagg 

atggtgtggg 

attggctcac 

cagggcgtcg 

ttgcgccata 

gccagcgacc 

tggattaacg 

ctgcatgcag 

cggcgttctt 

aatcgagcgc 



tagctgtggt 
tggaaagtat 
ttattcctgg 
cagacagtct 
atccgattta 
gcgactccag 
gcaatctcat 
gctttggtga 
tcgtttgctg 
aaatttacat 
tcgccagaga 
tgcctgatga 
gcggagactc 
aggaaggcat 
tcgacgttgc 
cgcaaacatc 



tcaacgtgcg 
tcagtctgct 
ttatcctgtc 
ttatcaacgc 
tgaagcggca 
cgtcagccga 
caatgttcat 
tgcgactgga 
ggagaactac 
tgcgcctact 
gggcagatgc 
ttttccagaa 
taccgttatc 
actgtattgc 
aggccattac 
tctgcgtatc 



tcaggatttt 
gcgaataatg 
tggttatggc 
ttaataagca 
aaacgtcatc 
acaacgctat 
cgaaaactga 
ttaagggtag 
atgccgttgg 
tatgacagtg 
tacgttgtcg 
aaagaaaccc 
gctcccggcg 
gatattgata 
tctcgcccag 
agggaagccg 



tagataagca 
gcgcagagct 
gtctgcgccc 
acgcggttga 
acgtcacggt 
acaacactta 
tgccgaccaa 
tagacactcc 
cacgctatgc 
gctcggactg 
gcagcggtaa 
tctatcctga 
gtgaaacatt 
ctgataaagt 
acatatttac 
agtaa 



gcagagcatc 
tgttgttttt 
tggcaaagac 
tttaagctca 
tgtatgcggc 
catcacggtt 
cccagagaga 
tgtcggaagg 
actttatgct 
gactgaaagc 
cttgttgaga 
taaagacgag 
agttgctccg 
ggcggcggct 
actcaacgta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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780 
840 
900 
945 



<210> 192 
<211> 314 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> .192 



Met 


Lys 


Lys 


Leu 


Ala 


Val 


Val 


Gin 


Arg 


Ala 


Ser 


Gly 


Phe 


Leu 


Asp 


Lys 


1 




5 










10 










15 




Gin 


Gin 


Ser 


He 


Ala 


Leu 


Ala 


Val 


Glu 


Ser 


He 


Gin 


Ser 


Ala 


Ala 


Asn 






20 










25 










30 






Asn 


Gly 


Ala 


Glu 


Leu 


Val 


Val 


Phe 


Thr 


Glu 


Ala 


Phe 


He 


Pro 


Gly 


Tyr 




35 










40 










45 








Pro 


Val 
50 


Trp 


Leu 


Trp 


Arg 


Leu 
55 


Arg 


Pro 


Gly 


Lys 


Asp 
60 


Trp 


Gly 


Thr 


Thr 


Asp 


Ser 


Leu 


Tyr 


Gin 


Arg 


Leu 


'lie 


Ser 


Asn 


Ala 


Val 


Asp 


Leu 


Ser 


Ser 


65 










70 










75 










80 


Ser 


Asp 


Leu 


Asp 


Pro 


He 


Tyr 


Glu 


Ala 


Ala 


Lys 


Arg 


His 


His 


Val 


Thr 








85 










90 










95 




Val 


Val 


Cys 


Gly 
100 


He 


Asn 


Glu 


Arg 


Asp 
105 


Ser 


Ser 


Val 


Ser 


Arg 
110 


Thr 


Thr 


Leu 


Tyr 


Asn 


Thr 


Tyr 


He 


Thr 


Val 


Cys 


His 


Glu 


Gly 


Asn 


Leu 


He 


Asn 




115 










120 










125 








Val 


His 


Arg 


Lys 


Leu 


Met 


Pro 


Thr 


Asn 


Pro 


Glu 


Arg 


Met 


Val 


Trp 


Gly 




130 








135 










140 










Phe 


Gly 


Asp 


Ala 


Thr 


Gly 


Leu 


Arg 


Val 


Val 


Asp 


Thr 


Pro 


Val 


Gly 


Arg 


145 






150 










155 










160 


He 


Gly 


Ser 


Leu 


Val 


Cys 


Trp 


Glu 


Asn 


Tyr 


Met 


Pro 


Leu 


Ala 


Arg 


Tyr 








165 










170 










175 




Ala 


Leu 


Tyr 


Ala 


Gin 


Gly 


Val 


Glu 


He 


Tyr 


He 


Ala 


Pro 


Thr 


Tyr 


Asp 






180 








185 










190 






Ser 


Gly 


Ser 


Asp 


Trp 


Thr 


Glu 


Ser 


Leu 


Arg 


His 


He 


Ala 


Arg 


Glu 


Gly 
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195 










200 


Arg Cys 


Tyr 


Val 


Val 


Gly 


Ser 


Gly 


210 










215 




Pro Asp 


Asp 


Phe 


Pro 


Glu 


Lys 


Glu 


225 








230 






Trp lie 


Asn 


Gly 


Gly 


Asp 


Ser 


Thr 








245 








Leu Val 


Ala 


Pro 


Leu 


His 


Ala 


Glu 






260 










Asp Thr 


Asp 


Lys 


Val 


Ala 


Ala 


Ala 




275 










280 


His Tyr 


Ser 


Arg 


Pro 


Asp 


He 


Phe 


290 










295 




Gin Thr 


Ser 


Leu 


Arg 


He 


Arg 


Glu 


305 








310 






<210> 193 












<211> 966 












<212> DNA 












<213> Unknown 











205 



Asn 


Leu 


Leu 


Arg 


Ala 


Ser 


Asp 


Leu 








220 










Thr 


Leu 


Tyr 


Pro 


Asp 


Lys 


Asp 


Glu 






235 










240 


Val 


He 


Ala 


Pro 


Gly 


Gly 


Glu 


Thr 




250 










255 




Glu 


Gly 


He 


Leu 


Tyr 


Cys 


Asp 


He 


265 










270 






Arg 


Arg 


Ser 


Phe 


Asp 


Val 


Ala 


Gly 










285 








Thr 


Leu 


Asn 


Val 


Asn 


Arg 


Ala 


Pro 



300 



Ala Glu 



<220> 

<223> Obtained from an environmental sample 



<400> 193 

atgtcaaacg 

ttcctcgatc 

gaaggagcgc 

tgggcaatcc 

aattcggtca 

aatgcctacg 

tataacacgc 

gtgcccacag 

tacgatactc 

gcccgctaca 

gggcaaccct 

ggctgctgta 

ttttacgcgg 

gggcagttta 

ccgcgcatgg 

gacgtattcc 

gtttaa 



agaaccacaa 
gtgaagcgac 
ggctggttgt 
caccgggcga 
cgattcccag 
tagtgatggg 
tgttgtacat 
gcggcgaacg 
cactgggtaa 
ccatgtacgc 
ggctctccac 
tcgtgatgcg 
aggccgatga 
gcgccgggcc 
tgcaaggccc 
agttgacggt 



ccaaacattc 
catcgacaaa 
cttcccggag 
agaaggcgtg 
tgatgtgacg 
gatgagcgaa 
cgatgcgcag 
gctggtgtgg 
actcggcggt 
atggggcaca 
tttacggcat 
caaagacgat 
gtggatcaac 
ggtacgcaaa 
gaagtggatg 
gcatacggat 



aaagttgccg 
gcttgtgagt 
gcattcatcc 
ctcaatgagt 
gatagactgt 
cgcaatgccg 
ggcgagattc 
gcgcagggcg 
ttgatttgct 
caaatctatg 
atcgccaaag 
atcccagatc 
ataggggaca 
caggaagaga 
ctcgacgtag 
gcgaggcaga 



cggtgcaggc 
tgattgctgc 
catcctaccc 
tgtacgcgga 
gccgagccgc 
aggccagtgg 
tgggcaaaca 
atggcagcac 
gggagaatta 
ttgcggcgac 
aaggcagggt 
gttacccgat 
gcgcaatcgt 
ttctctacgc 
cagggcacta 
tgatcaggtt 



cacacctgta 
agccggcaat 
agattgggta 
actgctctcc 
gagacttgcc 
cgcaagcctg 
tcgaaagctg 
gctgcaggtc 
tatgccgctg 
atgggatcgc 
gtacgtgatc 
gaagcagaag 
caatcctgaa 
ggaaattgat 
cgcgaggccg 
ggaacacgat 



60 
120 
180 
240 
300 
360 
420 
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540 
600 
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900 
960 
966 



<210> 194 
<211> 321 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 194 

Met Ser Asn Glu Asn His Asn Gin 

1 5 
Ala Thr Pro Val Phe Leu Asp Arg 

20 

Glu Leu He Ala Ala Ala Gly Asn 

35 40 
Pro Glu Ala Phe He Pro Ser Tyr 

50 55 
Pro Gly Glu Glu Gly Val Leu Asn 



Thr Phe Lys Val Ala Ala Val Gin 

10 15 

Glu Ala Thr He Asp Lys Ala Cys 

25 30 

Glu Gly Ala Arg Leu Val Val Phe 
45 

Pro Asp Trp Val Trp Ala He Pro 
60 

Glu Leu Tyr Ala Glu Leu Leu Ser 
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65 

Asn Ser Val Thr 

Ala Arg Leu Ala 
100 

Ala Glu Ala Ser 
115 

Ala Gin Gly Glu 
130 

Gly Glu Arg Leu 
145 

Tyr Asp Thr Pro 

Tyr Met Pro Leu 
180 

Tyr Val Ala Ala 
195 

Arg His lie Ala 
210 

Val Met Arg Lys 
225 

Phe Tyr Ala Glu 

Val Asn Pro Glu 
260 

Glu lie Leu Tyr 
275 

Trp Met Leu Asp 
290 

Leu Thr Val His 

305 

Val 



70 

He Pro Ser Asp 
85 

Asn Ala Tyr Val 

Gly Ala Ser Leu 
120 

He Leu Gly Lys 
135 

Val Trp Ala Gin 
150 

Leu Gly Lys Leu 
165 

Ala Arg Tyr Thr 

Thr Trp Asp Arg 
200 

Lys Glu Gly Arg 
215 

Asp Asp He Pro 
230 

Ala Asp Glu Trp 
245 

Gly Gin Phe Ser 

Ala Glu He Asp 
280 

Val Ala Gly His 
295 

Thr Asp Ala Arg 
310 



75 

Val Thr Asp Arg 
90 

Val Met Gly Met 
105 

Tyr Asn Thr Leu 

His Arg Lys Leu 
140 

Gly Asp Gly Ser 
155 

Gly Gly Leu He 
170 

Met Tyr Ala Trp 
185 

Gly Gin Pro Trp 

Val Tyr Val He 
220 

Asp Arg Tyr Pro 
235 

He Asn He Gly 
250 

Ala Gly Pro Val 
265 

Pro Arg Met Val 

Tyr Ala Arg Pro 
300 

Gin Met He Arg 
315 



80 

Leu Cys Arg Ala 
95 

Ser Glu Arg Asn 
110 

Leu Tyr He Asp 
125 

Val Pro Thr Gly 

Thr Leu Gin Val 
160 

Cys Trp Glu Asn 
175 

Gly Thr Gin He 
190 

Leu Ser Thr Leu 
205 

Gly Cys Cys He 

Met Lys Gin Lys 

240 

Asp Ser Ala lie 

255 

Arg Lys Gin Glu 
270 

Gin Gly Pro Lys 
285 

Asp Val Phe Gin 

Leu Glu His Asp 
320 



<210> 195 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 195 

atgaaagtcg 

accgtcgcca 

accttccccg 

gagcttctct 

gctgctaccg 

gtcaatgagc 

acgctgatcc 

cagggagatg 

gcatgtttcg 

atccattcgg 

atcaacatcc 

ctggacgccg 

acgggcggct 

cgctcgggtg 

atgctgatgg 

cgtacgccga 

ggttccgacg 



tcaaagccgc 
aggtcgtgca 
aaacggtcgt 
cgggcaccga 
acgcgatcgg 
gcgatggcgg 
agcgccgccg 
gctcgggctt 
agcacaacaa 
cgatgtaccc 
gccagcatgc 
accagcaggc 
gcttcaccac 
aaggcgaagt 
actcggccgg 
ccgcgcatgt 
atctgcgcac 



cgcggtacag 
gaagatccac 
gccttattac 
acatctgcgc 
cgaagccgcg 
cacgctgtac 
caagatcacg 
gcgtgcagtc 
cccgctcgcc 
tggctcggcc 
gctcgagtcc 
gcaaatcatg 
gatcgtctct 
gatcgtcgat 
ccactacaac 
tcacgaacgc 
ccaggctgca 



ttcagcccgg 
gaactcggcc 
ccttactttg 
ctgctcgaac 
cgaaaggccg 
aacgcacaac 
ccgacgcatt 
gacagcgccg 
cgctacgcaa 
tttggcgagg 
gccgctttcg 
aaggacaccg 
cctgacggca 
ctcgacttca 
cgccctgaac 
gcttcgcacc 
tga 



tgctctatag 
agaaaggcgt 
cggccgtcca 
aggctgtgac 
gcatggtcgt 
tgctcttcga 
tcgaacgcat 
tcggccgcgt 
tgatcgccga 
gcttcgccca 
tcgtcaacgc 
gttgtggaat 
tgctgatggc 
cgcagatcga 
tgctgagtct 
cgatgatcgt 



ccgtgaggca 
gcagttcgcc 
gacgggcatc 
tgtcccgtcc 
gtccattggc 
tgccgacggt 
gatctggggc 
cggccagctc 
cggcgagcag 
gcgtatggaa 
aacagcgtgg 
cggtccgatc 
cgagccgctt 
ccgccgcaag 
gatgatcgac 
caacgaccag 



60 
120 
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480 
540 
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<210> 196 
<211> 330 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 196 
Met Lys Val Val 
1 

Ser Arg Glu Ala 
20 

Gly Gin Lys Gly 
35 

Tyr Tyr Pro Tyr 
50 

Gly Thr Glu His 
65 

Ala Ala Thr Asp 

Val Ser He Gly 
100 

Gin Leu Leu Phe 
115 

He Thr Pro Thr 
130 

Ser Gly Leu Arg 
145 

Ala Cys Phe Glu 

Asp Gly Glu Gin 
180 

Glu Gly Phe Ala 
195 

Glu Ser Ala Ala 
210 

Gin Gin Ala Gin 

225 

Thr Gly Gly Cys 

Ala Glu Pro Leu 
260 

Phe Thr Gin lie 
275 

Tyr Asn Arg Pro 
290 

Ala His Val His 
305 

Gly Ser Asp Asp 



Lys Ala Ala Ala 
5 

Thr Val Ala Lys 

Val Gin Phe Ala 
40 

Phe Ala Ala Val 
55 

Leu Arg Leu Leu 
70 

Ala He Gly Glu 
85 

Val Asn Glu Arg 

Asp Ala Asp Gly 
120 

His Phe Glu Arg 
135 

Ala Val Asp Ser 
150 

His Asn Asn Pro 
165 

lie His Ser Ala 

Gin Arg Met Glu 
200 

Phe Val Val Asn 
215 

He Met Lys Asp 
230 

Phe Thr Thr He 

245 

Arg Ser Gly Glu 

Asp Arg Arg Lys 
280 

Glu Leu Leu Ser 
295 

Glu Arg Ala Ser 
310 

Leu Arg Thr Gin 
325 



Val Gin Phe Ser 
10 

Val Val Gin Lys 
25 

Thr Phe Pro Glu 

Gin Thr Gly He 
60 

Glu Gin Ala Val 
75 

Ala Ala Arg Lys 
90 

Asp Gly Gly Thr 
105 

Thr Leu He Gin 

Met He Trp Gly 
140 

Ala Val Gly Arg 
155 

Leu Ala Arg Tyr 
170 

Met Tyr Pro Gly 
185 

He Asn He Arg 

Ala Thr Ala Trp 
220 

Thr Gly Cys Gly 
235 

Val Ser Pro Asp 
250 

Gly Glu Val He 
265 

Met Leu Met Asp 

Leu Met He Asp 
300 

His Pro Met He 
315 

Ala Ala 
330 



Pro Val Leu Tyr 
15 

He His Glu Leu 
30 

Thr Val Val Pro 
45 

Glu Leu Leu Ser 

Thr Val Pro Ser 
80 

Ala Gly Met Val 
95 

Leu Tyr Asn Ala 
110 

Arg Arg Arg Lys 
125 

Gin Gly Asp Gly 

Val Gly Gin Leu 
160 

Ala Met He Ala 
175 

Ser Ala Phe Gly 
190 

Gin His Ala Leu 
205 

Leu Asp Ala Asp 

He Gly Pro He 

240 

Gly Met Leu Met 
255 

Val Asp Leu Asp 
270 

Ser Ala Gly His 
285 

Arg Thr Pro Thr 

Val Asn Asp Gin 
320 



<210> 197 
<211> 1017 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 197 
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atgggcatcg 
gacctcgatc 
gccagattga 
gactcgccgg 
gcttatgata 
gccgtgatcg 
ggccccgacg 
acggtctatg 
cggctcggcg 
gcccagaacg 
gccccggcgc 
tcctgcttcg 
gaccggcccg 
cccgacggca 
gacatcgatc 
tcgcggcccg 
tttgctttgc 



aacatccaaa 
gctcgatcga 
tcgcattccc 
cctgggcgat 
gtccgcaggc 
gcctttcgga 
gcgagaccat 
gcgagggcga 
cgctgtgctg 
agcaggtcca 
tcggcgcgga 
tgctggcgcc 
acaagcacgc 
gttcgctggc 
tcggcgcgat 
atgtcacgcg 
ccggcgatac 



ataccgcgtt 
caaggctatt 
ggaggtcttc 
cggccgcggc 
cgagcggctc 
gcggagcggc 
cgcgaagcgc 
cggcagcgat 
ctgggaacat 
tgtggcgtca 
ggtgaacaat 
gtgcgcgacc 
gctgctccat 
cgaaaagctc 
cggcgtcgcg 
gctgctgctg 
cgtggcgcct 



gccgccgtgc 
gcactgatcg 
atccccggct 
ttcgtgcagc 
cgcgcagcgg 
ggcagcctct 
cgcaagctcc 
ctggcggtcc 
ctgcaaccgt 
tggccgagtt 
gcggcctccc 
gtctcgcagg 
gccggcggtg 
gcgccggacc 
aagaacgccg 
aacaacaagc 
gccgatgtgg 



aggctgcgcc 
aggaggccgc 
acccctggca 
gttatttcga 
tccgcaaggc 
acatcgcgca 
gtccgacgca 
atgaccggcc 
tgtcgaaata 
tttcgctcta 
gggtctatgc 
ccatgatcga 
gctttgccgc 
aggagggcct 
ccgacccggc 
cctacaagcg 
atgcggcggc 



ggcctggctc 
cgcgaacggc 
tatctggctc 
caattcgctc 
gcgcctgacc 
atggctcgtt 
tgccgagcgc 
cgatatcgga 
tgcgatgtat 
cgatcccttt 
ggtcgaaggc 
tgagctgtgc 
gatctacggg 
gctttacgcc 
agggcattat 
cgtggagcat 
gagctga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1017 



<210> 198 
<211> 338 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from 



an environmental sample 



<400> 198 




























Met 


Gly 


He 


Glu 


His 


Pro 


Lys 


Tyr 


Arg 


Val 


Ala 


Ala 


Val 


Gin 


Ala 


Ala 


1 






5 










10 










15 




Pro 


Ala 


Trp 


Leu 


Asp 


Leu 


Asp 


Arg 


Ser 


He 


Asp 


Lys 


Ala 


He 


Ala 


Leu 






20 










25 










30 






lie 


Glu 


Glu 


Ala 


Ala 


Ala 


Asn 


Gly 


Ala 


Arg 


Leu 


He 


Ala 


Phe 


Pro 


Glu 






35 










40 










45 








Val 


Phe 


He 


Pro 


Gly 


Tyr 


Pro 


Trp 


His 


He 


Trp 


Leu 


Asp 


Ser 


Pro 


Ala 




50 










55 










60 










Trp 


Ala 


He 


Gly Arg 


Gly 


Phe 


Val 


Gin 


Arg 


Tyr 


Phe 


Asp 


Asn 


Ser 


Leu 


65 










70 










75 










80 


Ala 


Tyr 


Asp 


Ser 


Pro 


Gin 


Ala 


Glu 


Arg 


Leu 


Arg 


Ala 


Ala 


Val 


Arg 


Lys 






85 










90 










95 




Ala 


Arg 


Leu 


Thr 


Ala 


Val 


lie 


Gly 


Leu 


Ser 


Glu 


Arg 


Ser 


Gly 


Gly 


Ser 






100 










105 










110 






Leu 


Tyr 


He 


Ala 


Gin 


Trp 


Leu 


Val 


Gly 


Pro 


Asp 


Gly 


Glu 


Thr 


He 


Ala 




115 










120 










125 








Lys 


Arg 


Arg 


Lys 


Leu 


Arg 


Pro 


Thr 


His 


Ala 


Glu 


Arg 


Thr 


Val 


Tyr 


Gly 




130 










135 










140 








Gly 


Glu 


Gly 


Asp 


Gly 


Ser 


Asp 


Leu 


Ala 


Val 


His 


Asp 


Arg 


Pro 


Asp 


He 


145 




150 










155 










160 


Arg 


Leu 


Gly 


Ala 


Leu 


Cys 


Cys 


Trp 


Glu 


His 


Leu 


Gin 


Pro 


Leu 


Ser 


Lys 






165 










170 










175 




Tyr 


Ala 


Met 


Tyr 


Ala 


Gin 


Asn 


Glu 


Gin 


Val 


His 


Val 


Ala 


Ser 


Trp 


Pro 






180 










185 










190 






Ser 


Phe 


Ser 


Leu 


Tyr 


Asp 


Pro 


Phe 


Ala 


Pro 


Ala 


Leu 


Gly Ala 


Glu 


Val 






195 








200 










205 






Val 


Asn 


Asn 


Ala 


Ala 


Ser 


Arg 


Val 


Tyr 


Ala 


Val 


Glu 


Gly 


Ser 


Cys 


Phe 




210 










215 










220 










Leu 


Ala 


Pro 


Cys 


Ala 


Thr 


Val 


Ser 


Gin 


Ala 


Met 


He 


Asp 


Glu 


Leu 


cys 


225 








230 










235 










240 


Asp 


Arg 


Pro 


Asp 


Lys 


His 


Ala 


Leu 


Leu 


His 


Ala 


Gly 


Gly 


Gly 


Phe 


Ala 




245 










250 










255 




Ala 


lie 


Tyr 


Gly 


Pro 


Asp 


Gly 


Ser 


Ser 


Leu 


Ala 


Glu 


Lys 


Leu 


Ala 


Pro 






260 










265 










270 
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Asp 


Gin 


Glu 
275 


Gly 


Leu Leu 


Tyr 


Ala 
280 


Asp 


lie 


Asp 


Leu 


Gly Ala 
285 


l±e 


Oly 


Val 


Ala 
290 


Lys 


Asn 


Ala Ala 


Asp 
295 


Pro 


Ala 


Gly 


His 


Tyr 
300 


Ser Arg 


Pro 


Asp 


Val 


Thr 


Arg 


Leu 


Leu Leu 


Asn 


Asn 


Lys 


Pro 


Tyr 


Lys 


Arg Val 


Glu 


His 


305 






310 










315 








320 


Phe 


Ala 


Leu 


Pro 


Gly Asp 
325 


Thr 


Val 


Ala 


Pro 
330 


Ala 


Asp 


Val Asp 


Ala 
335 


Ala 


Ala 


Ser 



























<210> 199 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 199 

atgtcaaacg 

aaccgggaag 

gccaagctca 

gtcccctcgg 

gtggaaatcc 

tatgtcgtca 

accttgattt 

accgggggcg 

acgcccctgg 

tacgccatgt 

ccctggcttt 

tgcatcgcca 

cccgaccggg 

atcgccgggc 

atgcgcgggc 

gagctcaccg 

accgaaccca 



cactgaagcc 
ccacggcgga 
tcgttttccc 
ggagggatcg 
ccggcccggc 
tgggcgtgac 
atttcggtcc 
aacggatcgt 
gaaaactggg 
acgcctgggg 
cgacgcttcg 
tgcggaaagg 
actggatcaa 
ccgcttccaa 
ccaaatggat 
tccgccggga 
aagagaagaa 



gttcaaaatc 
gaaggccgcg 
gqaatcgttt 
ccttctcagc 
cacggggcat 
cgagcgggac 
gaccggggaa 
ctgggcccag 
cgggctgatc 
aacccagctt 
gcatatcgcc 
ggatatcccg 
ccccggcgac 
taaggaagag 
gctcgatgtg 
accgcggccg 
gaccgccggc 



gccgccgttc 
gctttgatcc 
attccggcct 
ggcctctacg 
atcggccggg 
acggaggcga 
attttgggca 
ggggacggaa 
tgctgggaaa 
tacgtggccg 
aaggaagggc 
gatcggttcg 
agcgcgatcg 
atcctttatg 
gccggccatt 
atgatccgcg 
tga 



aggcgacgcc 
gcgaagcggg 
atccggactg 
gggagatgct 
cggcgaagga 
gcggagccag 
aacaccggaa 
gcaccctgga 
actacatgcc 
ccacctggga 
gggtgtatgt 
aacacaaggg 
tcaaccccca 
cggaagtcga 
acgcgcggcc 
tggcgggagg 



ggtttttctg 
aagcgccgga 
ggtctgggtg 
cgaaaacgcc 
atcgggcgct 
tttgttcaac 
gctggttccc 
ggtctacgat 
cctggcccgg 
ccgaggcgaa 
catcgggtgc 
gctctacgcc 
gggggagatg 
cccgcagatg 
cgatgtcttc 
cgcgggcggg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
993 



<210> 200 
<211> 330 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from 



an environmental sample 



<400> 200 

Met Ser Asn Ala Leu Lys Pro Phe Lys lie Ala Ala Val Gin Ala Thr 

15 10 15 

Pro Val Phe Leu Asn Arg Glu Ala Thr Ala Glu Lys Ala Ala Ala Leu 

20 25 30 

lie Arg Glu Ala Gly Ser Ala Gly Ala Lys Leu lie Val Phe Pro Glu 

35 40 45 

Ser Phe lie Pro Ala Tyr Pro Asp Trp Val Trp Val Val Pro Ser Gly 

50 55 60 

Arg Asp Arg Leu Leu Ser Gly Leu Tyr Gly Glu Met Leu Glu Asn Ala 
65 70 75 80 

Val Glu lie Pro Gly Pro Ala Thr Gly His He Gly Arg Ala Ala Lys 

85 90 95 

Glu Ser Gly Ala Tyr Val Val Met Gly Val Thr Glu Arg Asp Thr Glu 
100 105 HO 
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Ala 


Ser 


Gly 
115 


Ala 


Ser 


Leu 


Phe 


Asn 
120 


Thr 


Leu 


He 


Tyr 


Phe 
125 


Gly 


Pro 


Thr 


Gly 


Glu 


He 


Leu 


Gly 


Lys 


His 


Arg 


Lys 


Leu 


Val 


Pro 


Thr 


Gly 


Gly 


Glu 


130 










135 










140 










Arg 


He 


Val 


Trp 


Ala 


Gin 


Gly 


Asp 


Gly 


Ser 


Thr 


Leu 


Glu 


Val 


Tyr 


Asp 


145 








150 










155 










160 


Thr 


Pro 


Leu 


Gly 


Lys 


Leu 


Gly 


Gly 


Leu 


He 


Cys 


Trp 


Glu 


Asn 


Tyr 


Met 








165 










170 










175 




Pro 


Leu 


Ala 


Arg 
180 


Tyr 


Ala 


Met 


Tyr 


Ala 
185 


Trp 


Gly 


Thr 


Gin 


Leu 
190 


Tyr 


Val 


Ala 


Ala 


Thr 
195 


Trp 


Asp 


Arg 


Gly 


Glu 
200 


Pro 


Trp 


Leu 


Ser 


Thr 

205 


Leu 


Arg 


His 


He 


Ala 


Lys 


Glu 


Gly 


Arg 


Val 


Tyr 


Val 


He 


Gly Cys 


Cys 


He 


Ala 


Met 




210 








215 










220 










Arg 


Lys 


Gly 


Asp 


He 


Pro 


Asp 


Arg 


Phe 


Glu 


His 


Lys 


Gly 


Leu 


Tyr 


Ala 


225 






230 










235 










240 


Pro 


Asp 


Arg 


Asp 


Trp 


He 


Asn 


Pro 


Gly 


Asp 


Ser 


Ala 


He 


Val 


Asn 


Pro 






245 










250 










255 




Gin 


Gly 


Glu 


Met 


He 


Ala 


Gly 


Pro 


Ala 


Ser 


Asn 


Lys 


Glu 


Glu 


He 


Leu 






260 










265 










270 






Tyr 


Ala 


Glu 


Val 


Asp 


Pro 


Gin 


Met 


Met 


Arg 


Gly 


Pro 


Lys 


Trp 


Met 


Leu 




275 










280 










285 








Asp 


Val 


Ala 


Gly 


His 


Tyr 


Ala 


Arg 


Pro 


Asp 


Val 


Phe 


Glu 


Leu 


Thr 


Val 


290 










295 










300 










Arg 


Arg 


Glu 


Pro 


Arg 


Pro 


Met 


He 


Arg 


Val 


Ala 


Gly 


Gly 


Ala 


Gly 


Gly 


305 








310 










315 










320 


Thr 


Glu 


Pro 


Lys 


Glu 
325 


Lys 


Lys 


Thr 


Ala 


Gly 
330 















<210> 201 
<211> 930 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 201 

atgaccacga 

gctggtacgc 

gtcatcgcgt 

gaggccgcga 

gccgtcgagg 

gtgacgatcg 

acgttcgggc 

gaacggctgc 

gtgaaggtcg 

cacgacgtcg 

gtggcctcgc 

ctgcgcgtgg 

aaggggctga 

gcgccggtgt 

cgcgaggcgc 

ggggtggtca 



agagcatccg 
tggaccggct 
tccccgagac 
tctggggcca 
tcccgggccc 
tggtcggcgc 
ccgagggcag 
tgtggggcta 
gggcgctggt 
gcgagcacgt 
ggcactatgc 
accagatgcc 
tgatcgcggg 
acgacgagga 
agacgctcga 
gacaccggcc 



catcgcggcc 
cgaggcctgg 
ctggctgccg 
tcccggctcg 
cgcgaccgcg 
gcacgagcgg 
gctgctcaat 
cggcgacggc 
gtgctgggag 
gcacgtcgcc 
gttcgagggc 
gaagcagctg 
cggcagcgcc 
gacgatcgtc 
tgtctcgggc 
gcgtgcgtaa 



gtacaggcgg 
gcccgcaagg 
ggctacccgg 
cgcgacctgc 
cgcatcgcga 
gcggggaaca 
caccaccgga 
gctggactgg 
cactggatgc 
ctgtggcccg 
cgctgtttcg 
ccgccgctgg 
atcatcgcgc 
accgccgact 
cactacagcc 



ctcccgcgtt 
ccgccgccac 
cgtggatcga 
accagcgcct 
agctcgccgg 
ccctctacaa 
agctggtgcc 
tggcgccggc 
cgctcacccg 
gcgtccacga 
tgatcgcggt 
agaagtacgc 
cgaacggccg 
gcgacctcgg 
ggccggacgt 



tctcgacctg 
cggtgcccgc 
ctcgtcgccg 
gatggagaat 
cgagctcggc 
cacggcgctg 
gacctacagc 
ggtggacggt 
ccaggcgatg 
gatgcaccag 
cgggagcatc 
gaagagcgcc 
ctacgtcgcg 
cgagatcccg 
gttcagcttc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
930 



<210> 202 
<211> 309 
<212> PRT 
<213> Unknown 



<220> 
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<223> Obtained from an environmental sample 
<400> 202 

Met Thr Thr Lys Ser lie Arg lie Ala Ala Val Gin Ala Ala Pro Ala 

15 10 15 

Phe Leu Asp Leu Ala Gly Thr Leu Asp Arg Leu Glu Ala Trp Ala Arg 

20 25 30 

Lys Ala Ala Ala Thr Gly Ala Arg Val He Ala Phe Pro Glu Thr Trp 

35 40 45 

Leu Pro Gly Tyr Pro Ala Trp He Asp Ser Ser Pro Glu Ala Ala He 

50 55 60 

Trp Gly His Pro Gly Ser Arg Asp Leu His Gin Arg Leu Met Glu Asn 
65 70 75 80 

Ala Val Glu Val Pro Gly Pro Ala Thr Ala Arg He Ala Lys Leu Ala 

85 90 95 

Gly Glu Leu Gly Val Thr He Val Val Gly Ala His Glu Arg Ala Gly 

100 105 HO 

Asn Thr Leu Tyr Asn Thr Ala Leu Thr Phe Gly Pro Glu Gly Arg Leu 

115 120 125 

Leu Asn His His Arg Lys Leu Val Pro Thr Tyr Ser Glu Arg Leu Leu 

130 135 140 

Trp Gly Tyr Gly Asp Gly Ala Gly Leu Val Ala Pro Ala Val Asp Gly 
145 150 155 160 

Val Lys Val Gly Ala Leu Val Cys Trp Glu His Trp Met Pro Leu Thr 

165 170 175 

Arg Gin Ala Met His Asp Val Gly Glu His Val His Val Ala Leu Trp 

180 185 190 

Pro Gly Val His Glu Met His Gin Val Ala Ser Arg His Tyr Ala Phe 

195 200 205 

Glu Gly Arg Cys Phe Val He Ala Val Gly Ser He Leu Arg Val Asp 

210 215 220 

Gin Met Pro Lys Gin Leu Pro Pro Leu Glu Lys Tyr Ala Lys Ser Ala 
225 230 235 240 

Lys Gly Leu Met He Ala Gly Gly Ser Ala He He Ala Pro Asn Gly 

245 250 255 

Arg Tyr Val Ala Ala Pro Val Tyr Asp Glu Glu Thr He Val Thr Ala 

260 265 270 

Asp Cys Asp Leu Gly Glu He Pro Arg Glu Ala Gin Thr Leu Asp Val 

275 280 285 

Ser Gly His Tyr Ser Arg Pro Asp Val Phe Ser Phe Gly Val Val Arg 

290 295 300 

His Arg Pro Arg Ala 
305 

<210> 203 
<211> 966 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 203 

atgtcaagct tacccacatc cgcgttcacc gtcgccgccg cgcaagcgtc gccagtgttc 

ctcgaccgcg acgcgacgct gcagaaggct tgcgggctga tcgccgacgc cgggcgcgcg 12 0 

ggcgcgcgcc tgatcgtctt ccccgaagcc ttcatcccgg cctaccccga ttgggtgtgg 180 

gcggtcccag ctggcgaaga ggggatgctg agcgagctct acgccgagct ggtcgcgaat 24 0 

tcgctggcta ttccgagcga cgcgaccgat cggctatgtc gcgcggcgca ggccgcgcat 300 

atcaatgtgg tcgtggggtt gagcgagcgc aatgtcgagg ccagcggcgc cagcctctac 360 

aacacgctgc tgtatatcga cgcggcggga acgatcctgg gtaaacaccg caagcttgtg 42 0 

ccgaccggcg gggagcgcct ggtctgggcg cagggcgacg gcagcacgct cgatgtgtac 480 



60 
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gacaccgcgc 
cgctacgcgc 
gagccctggc 
tgtggcatgg 
tatgcccagg 
gagtttattg 
gagcagatga 
gtcttccggc 
acctga 



tcggcaagct cggcggcctg atctgttggg aaaactacat gccgctggca 
tgtacgcctg gggtgtgcaa atctatgtcg cggccacctg ggatcgcggc 
tttctactct gcgacatatc gccaaggaag gccgtgtcta cgtgatcggc 
cgctgcgcag agatgatatt cccgatcgct tcgctttcaa gcagcgcttc 
ccggcgaatg gatcaacgtc ggcgacagcg cgatcgtcaa cccgagcggc 
ccggacctgt gcgcgaacgc gaggagattc tgtacgcgga ggtcgacccg 
gcgggccaaa gtggatgctc gacgtggccg ggcactacgg gcggccggat 
tcagcgtcaa ccgggcgccg caccagatga tccagacgga gaaccgggag 



540 
600 
660 
720 
780 
840 
900 
960 
966 



<210> 204 
<211> 321 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 204 

Met Ser Ser Leu Pro Thr Ser Ala Phe Thr Val Ala Ala Ala Gin Ala 

15 10 15 

Ser Pro Val Phe Leu Asp Arg Asp Ala Thr Leu Gin Lys Ala Cys Gly 

20 25 30 

Leu lie Ala Asp Ala Gly Arg Ala Gly Ala Arg Leu lie Val Phe Pro 

35 40 45 

Glu Ala Phe lie Pro Ala Tyr Pro Asp Trp Val Trp Ala Val Pro Ala 

50 55 60 

Gly Glu Glu Gly Met Leu Ser Glu Leu Tyr Ala Glu Leu Val Ala Asn 
65 70 75 80 

Ser Leu Ala He Pro Ser Asp Ala Thr Asp Arg Leu Cys Arg Ala Ala 

85 90 95 

Gin Ala Ala His He Asn Val Val Val Gly Leu Ser Glu Arg Asn Val 

100 105 110 

Glu Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr He Asp Ala 

115 120 125 

Ala Gly Thr He Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly 

130 135 140 

Glu Arg Leu Val Trp Ala Gin Gly Asp Gly Ser Thr Leu Asp Val Tyr 
145 150 155 160 

Asp Thr Ala Leu Gly Lys Leu Gly Gly Leu He Cys Trp Glu Asn Tyr 

165 170 175 

Met Pro Leu Ala Arg Tyr Ala Leu Tyr Ala Trp Gly Val Gin He Tyr 

180 185 190 

Val Ala Ala Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg 

195 200 205 

His He Ala Lys Glu Gly Arg Val Tyr Val He Gly Cys Gly Met Ala 

210 215 220 

Leu Arg Arg Asp Asp He Pro Asp Arg Phe Ala Phe Lys Gin Arg Phe 
225 230 235 240 

Tyr Ala Gin Ala Gly Glu Trp He Asn Val Gly Asp Ser Ala He Val 

245 250 255 

Asn Pro Ser Gly Glu Phe He Ala Gly Pro Val Arg Glu Arg Glu Glu 

260 265 270 

He Leu Tyr Ala Glu Val Asp Pro Glu Gin Met Ser Gly Pro Lys Trp 

275 280 285 

Met Leu Asp Val Ala Gly His Tyr Gly Arg Pro Asp Val Phe Arg Leu 

290 295 300 

Ser Val Asn Arg Ala Pro His Gin Met He Gin Thr Glu Asn Arg Glu 
305- 310 315 320 

Thr 
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<210> 205 
<211> 969 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 205 

atgaccaccg 

accgtagaca 

accttcccgg 

gaactgttgt 

cccgccacgg 

gtcaacgagc 

acgctcgtgc 

cagggcgatg 

gcctgcttcg 

atccactccg 

atcaacatcc 

ctcgatgccg 

tccggcggct 

cgcgagggcg 

ctgctgatgg 

cgcacgcctg 

atcctctga 



taaaagccgc 
aggtcgttcg 
aaaccgtagt 
ccggcaagga 
atgcgattgc 
gcgacggcgg 
agcgccgccg 
gttcgggatt 
agcacaacaa 
ccatgtaccc 
gccagcatgc 
accaacaggc 
gcttcacgac 
aaggcgaaat 
actcggtcgg 
ccgcgaactt 



cgcagtacag 
caagatccgc 
gccgtactac 
acacatgcgc 
tcaggcggcg 
caccatctac 
caagataacg 
gcgcgccgcg 
cccgctcgcc 
gggctcggcc 
cctggagtct 
gcagatcatg 
catcgtcacg 
catcgccgac 
tcactacaac 
ccatgaacgt 



atcagccccg 
gagctcggcc 
ccctacttcg 
ctgctggagc 
cgcgaagcca 
aacacgcagc 
ccaacgcact 
gacaccaagg 
cgttacgcca 
ttcggcgagg 
ggctgcttcg 
aaggacaccg 
cctgagggca 
ctcgatttct 
cgtccggagc 
accgcgagcc 



tgctctatag 
aaaagggagt 
ctgcagtcca 
aggccgttac 
atatggtggt 
tgctcttcga 
tcgagcgcat 
ttggccgcat 
tgatggccga 
gcttcgcgca 
tggtgaatgc 
gttgctcgat 
tgctgattgg 
cgatgatcga 
ttctgagcct 
aggcgaacgc 



ccgggaggcc 
gcagttcgcc 
gacaggcatc 
cgtcccctcg 
gtccatcggc 
tgccgacggc 
ggtctggggc 
cggccagttg 
tggcgagcag 
gcgcatggag 
gaccgcctgg 
cggcccgatc 
cgagccgctc 
tcgccgcaag 
cctgatcgat 
cggcgtcgaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
969 



<210> 206 
<211> 322 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 206 



Met 


Thr 


Thr 


Val 


Lys 


Ala 


Ala 


Ala 


Val 


Gin 


He 


Ser 


Pro 


Val 


Leu 


Tyr 


1 








5 










10 










15 




Ser 


Arg 


Glu 


Ala 


Thr 


Val 


Asp 


Lys 


Val 


Val 


Arg 


Lys 


He 


Arg 


Glu 


Leu 






20 










25 










30 






Gly 


Gin 


Lys 


Gly 


Val 


Gin 


Phe 


Ala 


Thr 


Phe 


Pro 


Glu 


Thr 


Val 


Val 


Pro 




35 










40 










45 








Tyr 


Tyr 


Pro 


Tyr 


Phe 


Ala 


Ala 


Val 


Gin 


Thr 


Gly 


He 


Glu 


Leu 


Leu 


Ser 


50 










55 










60 










Gly 


Lys 


Glu 


His 


Met 


Arg 


Leu 


Leu 


Glu 


Gin 


Ala 


Val 


Thr 


Val 


Pro 


Ser 


65 








70 










75 










80 


Pro 


Ala 


Thr 


Asp 


Ala 


He 


Ala 


Gin 


Ala 


Ala 


Arg 


Glu 


Ala 


Asn 


Met 


Val 








85 










90 










95 




Val 


Ser 


He 


Gly 
100 


Val 


Asn 


Glu 


Arg 


Asp 
105 


Gly 


Gly 


Thr 


He 


Tyr 
110 


Asn 


Thr 


Gin 


Leu 


Leu 
115 


Phe 


Asp 


Ala 


Asp 


Gly 

120 


Thr 


Leu 


Val 


Gin 


Arg 
125 


Arg 


Arg 


Lys 


He 


Thr 


Pro 


Thr 


His 


Phe 


Glu 


Arg 


Met 


Val 


Trp 


Gly 


Gin 


Gly Asp 


Gly 




130 










135 










140 










Ser 


Gly 


Leu 


Arg 


Ala 


Ala 


Asp 


Thr 


Lys 


Val 


Gly Arg 


He 


Gly 


Gin 


Leu 


145 








150 










155 










160 


Ala 


Cys 


Phe 


Glu 


His 


Asn 


Asn 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ala 


Met 


Met 


Ala 








165 










170 










175 




Asp 


Gly 


Glu 


Gin 


He 


His 


Ser 


Ala 


Met 


Tyr 


Pro 


Gly 


Ser 


Ala 


Phe 


Gly 




180 










185 










190 
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Glu 


Gly 


Phe 


Ala 


Gin 


Arg 


Met 


Glu 


He 


Asn 


He 


Arg 


Gin 


His 


Ala 


Leu 




195 










200 










205 








Glu 


Ser 


Gly 


Cys 


Phe 


Val 


Val 


Asn 


Ala 


Thr 


Ala 


Trp 


Leu 


Asp 


Ala 


Asp 




210 






215 










220 










Gin 


Gin 


Ala 


Gin 


He 


Met 


Lys 


Asp 


Thr 


Gly 


Cys 


Ser 


He 


Gly 


Pro 


He 


225 










230 










235 










240 


Ser 


Gly 


Gly 


Cys 


Phe 


Thr 


Thr 


He 


Val 


Thr 


Pro 


Glu 


Gly 


Met 


Leu 


He 






245 










250 










255 




Gly 


Glu 


Pro 


Leu 


Arg 


Glu 


Gly 


Glu 


Gly 


Glu 


He 


He 


Ala 


Asp 


Leu 


Asp 






260 










265 










270 






Phe 


Ser 


Met 


He 


Asp 


Arg 


Arg 


Lys 


Leu 


Leu 


Met 


Asp 


Ser 


Val 


Gly 


His 






275 








280 










285 








Tyr 


Asn 


Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


Leu 


Leu 


He 


Asp 


Arg 


Thr 


Pro 


Ala 


290 








295 










300 










Ala 


Asn 


Phe 


His 


Glu 


Arg 


Thr 


Ala 


Ser 


Gin 


Ala 


Asn 


Ala 


Gly 


Val 


Glu 


305 










310 










315 










320 


He 


Leu 































<210> 207 
<211> 966 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 207 

atgtcaaacg 

tttctcgatc 

gaaggggcac 

tgggcaatcc 

aactcggtca 

aatgcttacg 

tataacacgc 

gtgccaacgg 

tacgatactc 

gcccgctata 

gggcaaccct 

ggttgttgta 

ttttacgcgg 

gggcaattta 

ccgcgcatgg 

gatgtgttcc 

tcttaa 



agaacaacaa 
gtgaagcgac 
gattggttgt 
caccgggtga 
cgattcccag 
tggtgatggg 
tgttgtacat 
gcggcgagcg 
cactgggaaa 
ccatgtatgc 
ggctctccac 
tcgcgatgcg 
aggcagatga 
tcgcagggcc 
tacaagggcc 
agttgacggt 



cgctacattc 
tctcgacaag 
ctttccagaa 
agagggcgta 
tgacgcgacg 
gataagcgaa 
cgatgcgcag 
gctggtgtgg 
actcggcggt 
ctggggcaca 
tttgcggcat 
caaagacgat 
gtggatcaat 
agtacgcaag 
gaagtggatg 
gcatacggat 



aaagttgccg 
gcttgcgatt 
gccttcatac 
cttaatgagt 
gacagactgt 
cgcaatgtcg 
ggtgagattc 
gcgcagggcg 
ttaatttgct 
caaatctatg 
atcgccaaag 
atccctgatc 
ataggtgaca 
caggaagaga 
ctcgacgtgg 
gtgcgacaga 



cagtacaggc 
tgatcgccgc 
cggcctatcc 
tgtacgcaga 
gccgggccgc 
aggcgagtgg 
taggcaaaca 
atggcagcac 
gggagaatta 
tcgccgctac 
aaggcagggt 
gttacgcaat 
gcgcgattgt 
ttctctacgc 
cggggcacta 
tgattcggat 



tacacctgtt 
cgccggaggt 
ggattgggta 
gctgctctcc 
gaggcttgct 
agcaagcctg 
tcgaaagcta 
actgcaggtc 
tatgccgctg 
gtgggatcgc 
gtacgtgatt 
gaagcagaag 
caatcctgaa 
agagattgat 
tgccaggccg 
ggaacacgat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
966 



<210> 208 
<211> 321 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 208 

Met Ser Asn Glu Asn Asn Asn Ala Thr Phe Lys Val Ala Ala Val Gin 

15 10 15 

Ala Thr Pro Val Phe Leu Asp Arg Glu Ala Thr Leu Asp Lys Ala Cys 

20 25 30 

Asp Leu He Ala Ala Ala Gly Gly Glu Gly Ala Arg Leu Val Val Phe 
35 40 45 
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Pro 


Glu 


Ala 


Phe 


lie 


Pro 


Ala 


Tyr 


Pro 


Asp 


Trp 


Val 


Trp 


Ala 


He 


Pro 




50 










55 










60 










Pro 


Gly 


Glu 


Glu 


Gly 


Val 


Leu 


Asn 


Glu 


Leu 


Tyr 


Ala 


Glu 


Leu 


Leu 


Ser 


65 










70 










75 










80 


Asn 


Ser 


Val 


Thr 


lie 


Pro 


Ser 


Asp 


Ala 


Thr 


Asp 


Arg 


Leu 


Cys 


Arg 


Ala 










85 










90 










95 




Ala 


Arg 


Leu 


Ala 


Asn 


Ala 


Tyr 


Val 


Val 


Met 


Gly 


He 


Ser 


Glu 


Arg 


Asn 






100 










105 










110 






Val 


Glu 


Ala 


Ser 


Gly 


Ala 


Ser 


Leu 


Tyr 


Asn 


Thr 


Leu 


Leu 


Tyr 


He 


Asp 






115 










120 










125 








Ala 


Gin 


Gly 


Glu 


lie 


Leu 


Gly 


Lys 


His 


Arg 


Lys 


Leu 


Val 


Pro 


Thr 


Gly 




130 










135 










140 










Gly 


Glu 


Arg 


Leu 


Val 


Trp 


Ala 


Gin 


Gly 


Asp 


Gly 


Ser 


Thr 


Leu 


Gin 


Val 


145 








150 










155 










160 


Tyr 


Asp 


Thr 


Pro 


Leu 


Gly 


Lys 


Leu 


Gly 


Gly 


Leu 


He 


Cys 


Trp 


Glu 


Asn 






165 










170 










175 




Tyr 


Met 


Pro 


Leu 


Ala 


Arg 


Tyr 


Thr 


Met 


Tyr 


Ala 


Trp 


Gly 


Thr 


Gin 


He 






180 










185 










190 






Tyr 


Val 


Ala 


Ala 


Thr 


Trp 


Asp 


Arg 


Gly 


Gin 


Pro 


Trp 


Leu 


Ser 


Thr 


Leu 




195 










200 










205 








Arg 


His 


lie 


Ala 


Lys 


Glu 


Gly 


Arg 


Val 


Tyr 


Val 


He 


Gly 


Cys 


Cys 


He 


210 










215 










220 










Ala 


Met 


Arg 


Lys 


Asp 


Asp 


lie 


Pro 


Asp 


Arg 


Tyr 


Ala 


Met 


Lys 


Gin 


Lys 


225 






230 










235 










240 


Phe 


Tyr 


Ala 


Glu 


Ala 


Asp 


Glu 


Trp 


He 


Asn 


He 


Gly 


Asp 


Ser 


Ala 


He 








245 










250 










255 




Val 


Asn 


Pro 


Glu 


Gly 


Gin 


Phe 


He 


Ala 


Gly 


Pro 


Val 


Arg 


Lys 


Gin 


Glu 








260 










265 










270 






Glu 


lie 


Leu 


Tyr 


Ala 


Glu 


He 


Asp 


Pro 


Arg 


Met 


Val 


Gin 


Gly 


Pro 


Lys 






275 










280 










285 








Trp 


Met 


Leu 


Asp 


Val 


Ala 


Gly 


His 


Tyr 


Ala 


Arg 


Pro 


Asp 


Val 


Phe 


Gin 


290 








295 










300 










Leu 


Thr 


Val 


His 


Thr 


Asp 


Val 


Arg 


Gin 


Met 


He 


Arg 


Met 


Glu 


His 


Asp 



305 310 315 320 

Ser 



<210> 209 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 209 

atgaaagtcg 

accgtcgaaa 

acctttcccg 

gagcttctgt 

cccgctaccg 

gtgaatgaac 

accttgatcc 

cagggggacg 

gcctgcttcg 

atccattccg 

atcaatatcc 

ctggatgctg 

tcgggcggct 

cgttcgggcg 

atgttgatgg 



tcaaagccgc 
aggtcgtgaa 
agaccgtagt 
ccggaactga 
atgcgatcgg 
gcgacggcgg 
agcgccgccg 
gctcgggcct 
agcacaacaa 
ccatgtatcc 
gccagcatgc 
accagcaggc 
gcttcaccac 
agggtgaggt 
actcggcggg 



cgctgtccag 
gaaaattcac 
gccttactac 
gcatctgcgg 
agaggcggcc 
caccttgtac 
caagatcacg 
gcgcgccgtc 
cccgctggcc 
gggttctgct 
gctggagtct 
gcaaatcatc 
gatcgtggca 
catcgtggat 
ccactataac 



atcagtcccg 
gagcttggcc 
ccgtactttt 
ctgctcgatc 
cgcaaggcgg 
aacacacagt 
cccacccact 
gacagcaagg 
cgctacgcgc 
ttcggcgaag 
ggtgcctttg 
aaggacaccg 
cccgacggca 
ctcgacttca 
cgtccagaac 



ttctctacag 
aactgggcgt 
ccgccgtcca 
aggccgtgac 
gcatggtggt 
tgctcttcga 
tcgaacggat 
ttggtcgcat 
tgattgccga 
gctttgccca 
tcgtcaacgc 
gctgtgggat 
tgctgatggc 
cgctgatcga 
tgctcagtct 



ccgtgaggca 
gcagttcgcc 
gacaggcatt 
ggtaccgtct 
gtccatcggc 
tgccgatggc 
gatctggggc 
tggtcagctt 
cggcgagcag 
aaggatggaa 
aacggcctgg 
tggcccgatc 
cgaacctctg 
ccgacgcaag 
catgattgac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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cgtaccgcga cggcgcatgt tcacgaacgc gctgcgcatc cggtgtcggg cgcggagcag 9 60 

ggtccggagg atctgcgcac tccggccgcg tga 993 

<210> 210 
<211> 330 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 210 
Met Lys Val Val 
1 

Ser Arg Glu Ala 
20 

Gly Gin Leu Gly 
35 

Tyr Tyr Pro Tyr 
50 

Gly Thr Glu His 
65 

Pro Ala Thr Asp 

Val Ser He Gly 
100 

Gin Leu Leu Phe 
115 

He Thr Pro Thr 
130 

Ser Gly Leu Arg 
145 

Ala Cys Phe Glu 

Asp Gly Glu Gin 
180 

Glu Gly Phe Ala 
195 

Glu Ser Gly Ala 
210 

Gin Gin Ala Gin 
225 

Ser Gly Gly Cys 

Ala Glu Pro Leu 
260 

Phe Thr Leu He 
275 

Tyr Asn Arg Pro 
290 

Ala His Val His 
305 

Gly Pro Glu Asp 



Lys Ala Ala Ala 
5 

Thr Val Glu Lys 

Val Gin Phe Ala 
40 

Phe Ser Ala Val 
55 

Leu Arg Leu Leu 
70 

Ala He Gly Glu 
85 

Val Asn Glu Arg 

Asp Ala Asp Gly 
120 

His Phe Glu Arg 
135 

Ala Val Asp Ser 
150 

His Asn Asn Pro 
165 

He His Ser Ala 

Gin Arg Met Glu 
200 

Phe Val Val Asn 
215 

He He Lys Asp 
230 

Phe Thr Thr He 
245 

Arg Ser Gly Glu 

Asp Arg Arg Lys 
280 

Glu Leu Leu Ser 
295 

Glu Arg Ala Ala 
310 

Leu Arg Thr Pro 
325 



Val Gin He Ser 
10 

Val Val Lys Lys 
25 

Thr Phe Pro Glu 

Gin Thr Gly He 
60 

Asp Gin Ala Val 
75 

Ala Ala Arg Lys 
90 

Asp Gly Gly Thr 
105 

Thr Leu He Gin 

Met He Trp Gly 
140 

Lys Val Gly Arg 
155 

Leu Ala Arg Tyr 
170 

Met Tyr Pro Gly 
185 

He Asn He Arg 

Ala Thr Ala Trp 
220 

Thr Gly Cys Gly 
235 

Val Ala Pro Asp 
250 

Gly Glu Val He 
265 

Met Leu Met Asp 

Leu Met He Asp 
300 

His Pro Val Ser 
315 

Ala Ala 
330 



Pro Val Leu Tyr 
15 

He His Glu Leu 
30 

Thr Val Val Pro 
45 

Glu Leu Leu Ser 

Thr Val Pro Ser 
80 

Ala Gly Met Val 
95 

Leu Tyr Asn Thr 
110 

Arg Arg Arg Lys 
125 

Gin Gly Asp Gly 

He Gly Gin Leu 
160 

Ala Leu He Ala 
175 

Ser Ala Phe Gly 
190 

Gin His Ala Leu 
205 

Leu Asp Ala Asp 

He Gly Pro He 
240 

Gly Met Leu Met 
255 

Val Asp Leu Asp 

270 . 
Ser Ala Gly His 
285 

Arg Thr Ala Thr 

Gly Ala Glu Gin 
320 



<210> 211 
<211> 1062 
<212> DNA 
<213> Unknown 



<220> 
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<223> Obtained from an environmental sample 



<400> 211 

atgggcgtcg cacatccgaa atacaaagtg gccgccgtgc aggcagcgcc cgcttttctc 60 

gacctggacg cctcggtcga aaaggccgtc cgtttcatcg acgaagccgg cgccgccggc 120 

gcccgcctca tcgcctttcc ggagacctgg atacccggtt acccctggtg gatctggcta 180 

ggcgcgccgg cctgggctat catgcgcggc ttcgtctcgc gctatttcga caactcgctc 24 0 

agctacgaca gcccgcaggc cgagaagctc cgcgccgccg ccaagcgcaa caagatggtg 300 

gtggtgctcg gcctctccga gcgcgacggc ggcagccttt acatcgcgca atggatcatc 3 60 

ggcccggacg gcgaaaccat cgccaagcgc cgcaagctca agccgaccca cgcggagcgg 42 0 

accgtgttcg gcgaaggcga cggctcgcat cttgcggtgc acgagcttga tgttggccgg 48 0 

ctcggcgcgc tgtgctgctg ggaacacctg cagccgctgt ccaaatacgc catgtatgcg 54 0 

cagaacgaac aggtgcatgt cgcggcctgg ccgagctttt cgctttacga tccgttcgcg 600 

cacgcgctcg gcgcggaagt gaacaatgcg gcgagcaaaa tctatgcggt cgagggctcg 660 

tgtttcgtca tcgcgccgtg cgcgaccgtt tcgcaggcga tgatcgacga actctgcgat 72 0 

acgccggaga agcatcagtt cctgcatgcc ggcggcggct ttgccgtgat ttacggcccc 7 80 

gacggcgccc cgctcgcggc gccgctgccg cccgacaagg aaggcttgct ctacgccgac 84 0 

atcgatctcg ggatgatttc ggttgccaaa gcggcagccg atccggccgg gcattatgca 900 

cgccccgacg tcacccggct tctgttcaac aatcggcctg ggtatcgggt cgagaccatg 960 

gcgttgccga tcgatgcgga gaccaaggcg gaagcaccgg ctaagccgga acccaaggca 102 0 

ccgaacgtgg cgccgttcgc gccggtgcaa gcggccgagt ga 1062 



<210> 212 
<211> 353 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 212 



Met 


Gly 


Val 


Ala 


His 


Pro 


Lys 


Tyr 


Lys 


Val 


Ala 


Ala 


Val 


Gin 


Ala 


Ala 


1 








5 










10 










15 




Pro 


Ala 


Phe 


Leu 
20 


Asp 


Leu 


Asp 


Ala 


Ser 
25 


Val 


Glu 


Lys 


Ala 


Val 
30 


Arg 


Phe 


He 


Asp 


Glu 


Ala 


Gly 


Ala 


Ala 


Gly 


Ala 


Arg 


Leu 


He 


Ala 


Phe 


Pro 


Glu 




35 










40 










45 








Thr 


Trp 


He 


Pro 


Gly 


Tyr 


Pro 


Trp 


Trp 


He 


Trp 


Leu 


Gly Ala 


Pro 


Ala 




50 










55 










60 










Trp 


Ala 


He 


Met 


Arg 


Gly 


Phe 


Val 


Ser 


Arg 


Tyr 


Phe 


Asp 


Asn 


Ser 


Leu 


65 










70 










75 










80 


Ser 


Tyr 


Asp 


Ser 


Pro 


Gin 


Ala 


Glu 


Lys 


Leu 


Arg 


Ala 


Ala 


Ala 


Lys 


Arg 






85 










90 










95 




Asn 


Lys 


Met 


Val 


Val 


Val 


Leu 


Gly 


Leu 


Ser 


Glu 


Arg 


Asp 


Gly 


Gly 


Ser 






100 










105 










110 






Leu 


Tyr 


He 
115 


Ala 


Gin 


Trp 


He 


He 
120 


Gly 


Pro 


Asp 


Gly 


Glu 
125 


Thr 


He 


Ala 


Lys 


Arg 
130 


Arg 


Lys 


Leu 


Lys 


Pro 
135 


Thr 


His 


Ala 


Glu 


Arg 
140 


Thr 


Val 


Phe 


Gly 


Glu 


Gly 


Asp 


Gly 


Ser 


His 


Leu 


Ala 


Val 


His 


Glu 


Leu 


Asp 


Val 


Gly 


Arg 


145 




150 










155 










160 


Leu 


Gly 


Ala 


Leu 


Cys 


Cys 


Trp 


Glu 


His 


Leu 


Gin 


Pro 


Leu 


Ser 


Lys 


Tyr 








165 










170 










175 




Ala 


Met 


Tyr 


Ala 


Gin 


Asn 


Glu 


Gin 


Val 


His 


Val 


Ala 


Ala 


Trp 


Pro 


Ser 






180 










185 










190 






Phe 


Ser 


Leu 


Tyr 


Asp 


Pro 


Phe 


Ala 


His 


Ala 


Leu 


Gly Ala 


Glu 


Val 


Asn 






195 










200 










205 








Asn 


Ala 
'210 


Ala 


Ser 


Lys 


He 


Tyr 

215 


Ala 


Val 


Glu 


Gly 


Ser 
220 


Cys 


Phe 


Val 


He 


Ala 


Pro 


Cys 


Ala 


Thr 


Val 


Ser 


Gin 


Ala 


Met 


He 


Asp 


Glu 


Leu 


Cys 


Asp 



225 230 235 240 
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Thr 


Pro 


GlU 


Lys 


riis 




irne 


Leu 


Hie: 


Ala 


Gly 


Gly 


Glv 


Phe 


Ala 


Val 








1 A ^ 
<L 41 3 










250 










255 




lie 


Tyr 


Gly 


Pro 


Asp 


Gly 


Ala 


Pro 


Leu 


Ala 


Ala 


Pro 


Leu 


Pro 


Pro 


Asp 




2 60 










zoo 










270 






Lys 


Glu 


Gly 


Leu 


Leu 


Tyr 


Fid. a 


Asp 


Tip 

X -L ti 


Asp 


Leu 


Gly 


Met 


lie 


Ser 


Val 




O *7 










Z. O VJ 










285 








Ala 


Lys 
290 


Ala 


Ala 


Ala 


Asp 


Pro 
295 


Ala 


Gly 


His 


Tyr 


Ala 
300 


Arg 


Pro 


Asp 


Val 


Thr 


Arg 


Leu 


Leu 


Phe 


Asn 


Asn 


Arg 


Pro 


Gly 


Tyr 


Arg 


Val 


Glu 


Thr 


Met 


305 








310 










315 










320 


Ala 


Leu 


Pro 


lie 


Asp 
325 


Ala 


Glu 


Thr 


Lys 


Ala 
330 


Glu 


Ala 


Pro 


Ala 


Lys 
335 


Pro 


Glu 


Pro 


Lys 


Ala 


Pro 


Asn 


Val 


Ala 


Pro 


Phe 


Ala 


Pro 


Val 


Gin 


Ala 


Ala 






340 










345 










350 







Glu 



<210> 213 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 213 

atgagagttg 

acggtcgaaa 

acctttcctg 

gagcaaatct 

cctgccaccg 

gtcaacgagc 

aggataattc 

cagggcgacg 

gcatgctggg 

atccattccg 

atcaatatcc 

ctggacggcg 

tccggcggtt 

cgttccggtg 

caggtgatgg 

cgcaccccta 

ggctccgagg 



ttaaagccgc 
aggtcgtgcg 
agaccgtggt 
tcggaacaga 
acgcgatcgg 
gagacggggg 
agcggcgccg 
gctcaggtct 
agcacaacaa 
ccatgtatcc 
ggcagcatgc 
atcagcaggc 
gcttcactgc 
agggcgtggt 
actcgcgagg 
ccgcgcactt 
atgtgttcga 



tgctgtccaa 
gaagatccat 
gccttactac 
gtatctgagg 
cgaggcagcc 
aactctatac 
caagatcacg 
gcgggccgtt 
tccactggcg 
gggctccatg 
cctagagtct 
gcatatcatg 
gatcgtcgca 
catcgccgac 
ccattacagc 
tcacgaacgc 
ggctcgcatt 



ctgagtcccg 
gaacttgccg 
ccgtactttt 
ctgctcgacc 
aggttcgctg 
aacactcagc 
cccacccatt 
gatagcaagg 
cgctacgcgc 
ttcggcgact 
gggtgcttcg 
aaggacaccg 
cccgatggta 
ctcgacttca 
cggccggagt 
gcttcgcccc 
taa 



tcctctatag 
aagagggagt 
ccttcgttca 
aggcagtcac 
gagttgttgt 
ttctcttcga 
acgagcgcat 
ccggccgtat 
tgatggccga 
cgtttgccca 
tcgtgaacgc 
gctgcagcat 
gcctgc'tggg 
cgttgatcga 
tgctcagcct 
ccacgacaga 



tcgcgaggga 
cgagttcgcc 
gacgcccttg 
cgtgccatcc 
ctcgatcggc 
tgccgacggc 
gatctggggc 
tggtcagctg 
cggcgagcag 
gaagaccgaa 
aacggcctgg 
cggcccgatc 
cgaacccatc 
caggcgtaag 
cttaatagac 
agctgagcaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
993 



<210> 214 
<211> 330 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
<400> 214 

Met Arg Val Val Lys Ala Ala Ala Val Gin Leu Ser Pro Val Leu Tyr 

15 10 15 

Ser Arg Glu Gly Thr Val Glu Lys Val Val Arg Lys lie His Glu Leu 

20 25 30 

Ala Glu Glu Gly Val Glu Phe Ala Thr Phe Pro Glu Thr Val Val Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser Phe Val Gin Thr Pro Leu Glu Gin lie Phe 
50 - 55 60 
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Gly Thr Glu Tyr 
65 

Pro Ala Thr Asp 

Val Ser lie Gly 
100 

Gin Leu Leu Phe 
115 

lie Thr Pro Thr 
130 

Ser Gly Leu Arg 
145 

Ala Cys Trp Glu 

Asp Gly Glu Gin 
180 

Asp Ser Phe Ala 
195 

Glu Ser Gly Cys 

210 

Gin Gin Ala His 
225 

Ser Gly Gly Cys 

Gly Glu Pro lie 
260 

Phe Thr Leu lie 
275 

Tyr Ser Arg Pro 
290 

Ala His Phe His 
305 

Gly Ser Glu Asp 



Leu Arg Leu Leu 
70 

Ala lie Gly Glu 
85 

Val Asn Glu Arg 

Asp Ala Asp Gly 
120 

His Tyr Glu Arg 
135 

Ala Val Asp Ser 
150 

His Asn Asn Pro 
165 

lie His Ser Ala 

Gin Lys Thr Glu 
200 

Phe Val Val Asn 
215 

He Met Lys Asp 
230 

Phe Thr Ala He 

245 

Arg Ser Gly Glu 

Asp Arg Arg Lys 
280 

Glu Leu Leu Ser 
295 

Glu Arg Ala Ser 
310 

Val Phe Glu Ala 
325 



Asp Gin Ala Val 
75 

Ala Ala Arg Phe 
90 

Asp Gly Gly Thr 
105 

Arg He He Gin 

Met He Trp Gly 
140 

Lys Ala Gly Arg 
155 

Leu Ala Arg Tyr 
170 

Met Tyr Pro Gly 
185 

He Asn He Arg 

Ala Thr Ala Trp 

220 

Thr Gly Cys Ser 
235 

Val Ala Pro Asp 
250 

Gly Val Val He 
265 

Gin Val Met Asp 

Leu Leu He Asp 
300 

Pro Pro Thr Thr 
315 

Arg He 
330 



Thr Val Pro Ser 
80 

Ala Gly Val Val 
95 

Leu Tyr Asn Thr 
110 

Arg Arg Arg Lys 
125 

Gin Gly Asp Gly 

He Gly Gin Leu 
160 

Ala Leu Met Ala 
175 

Ser Met Phe Gly 
190, 

Gin His Ala Leu 
205 

Leu Asp Gly Asp 

He Gly Pro He 
240 

Gly Ser Leu Leu 
255 

Ala Asp Leu Asp 
270 

Ser Arg Gly His 
285 

Arg Thr Pro Thr 

Glu Ala Glu Gin 
320 



<210> 215 
<211> 1008 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



60 



<400> 215 ^ ^ . 

atgagtatta cccaccccaa atttaaagct gctgttgttc aagctgcccc agtatttcta 
gacctagatg ggtctgttaa taaggcgatt aatctcattg atgaagctgc cgctgccgga 120 
gccaagctca ttgccttccc tgaaaccttc attccaggct atccatggtg gatttggctg 18 0 

ggatcgccgg cgtgggctct gggccggggg ttcgttcagc gttacttcga caattccctg 24 0 

cagtacgaca gcccgcaggc ggatcgctta cgcgaggcgg cacgacgcaa cagcattacg 
gtcgtgctgg gcttgtccga gcgtgatggc ggttctctct atatcgcaca gtggctgatc 
ggcccggatg gcgaaaccat cgcgcagcgg cgcaagcttc gtcctactca tggggagcgc 
acggtattcg gtgaagggga tggcagcgat ctggtggttc atcaaaccga actgggccgt 
cttggcgcgc ttaactgctg ggagaacatc ctgtctctga acaaatatgt gatgtactcc 
cagcatgaac aggtccatgt agcatcctgg cccagtttct cgacgtatga accgttcgcg 
catgcgctcg gctatgaggt aaacaacgca attagccagg tctatgcggt ggaaggcggg 
tgcttcgtgt tggccccgtg ctctaccatc tctgaagaaa tgattgccga actgtgcgat 
acacccgata aattcgagct gacgcatgct ggtggcggcc acgcaatcat ctatggtccg 
gacggtcgtg ctctgtgcga aaagctgccc gagaaccagg agggcctgct gtacgcggaa 
atcgatctgg gggtgatttc tatggcaaaa agtgccatgg atcctgtcgg ccattactct 
cgccccgatg tctaccgtgt gctgttcaat aagatcccgg caaagcgtat cgagcacttc 
aatttgccgt tggatgagca agcaggggaa gagccaccag ctgattaa 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1008 
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<210> 216 
<211> 335 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 216 

Met Ser He Thr His Pro Lys Phe Lys Ala Ala Val Val Gin Ala Ala 

15 10 15 

Pro Val Phe Leu Asp Leu Asp Gly Ser Val Asn Lys Ala He Asn Leu 

20 25 30 

He Asp Glu Ala Ala Ala Ala Gly Ala Lys Leu lie Ala Phe Pro Glu 

35 40 45 

Thr Phe He Pro Gly Tyr Pro Trp Trp He Trp Leu Gly Ser Pro Ala 

50 55 60 

Trp Ala Leu Gly Arg Gly Phe Val Gin Arg Tyr Phe Asp Asn Ser Leu 
65 70 75 80 

Gin Tvr Asp Ser Pro Gin Ala Asp Arg Leu Arg Glu Ala Ala Arg Arg 

85 90 95 

Asn Ser He Thr Val Val Leu Gly Leu Ser Glu Arg Asp Gly Gly Ser 

100 105 HO 

Leu Tyr He Ala Gin Trp Leu He Gly Pro Asp Gly Glu Thr He Ala 

115 120 125 

Gin Arg Arg Lys Leu Arg Pro Thr His Gly Glu Arg Thr Val Phe Gly 

130 135 140 

Glu Gly Asp Gly Ser Asp Leu Val Val His Gin Thr Glu Leu Gly Arg 
145 150 155 160 

Leu Gly Ala Leu Asn Cys Trp Glu Asn He Leu Ser Leu Asn Lys Tyr 

165 170 175 

Val Met Tyr Ser Gin His Glu Gin Val His Val Ala Ser Trp Pro Ser 

180 185 190 

Phe Ser Thr Tyr Glu Pro Phe Ala His Ala Leu Gly Tyr Glu Val Asn 

195 200 205 

Asn Ala He Ser Gin Val Tyr Ala Val Glu Gly Gly Cys Phe Val Leu 

210 215 220 

Ala Pro Cys Ser Thr He Ser Glu Glu Met He Ala Glu Leu Cys Asp 
225 230 235 240 

Thr Pro Asp Lys Phe Glu Leu Thr His Ala Gly Gly Gly His Ala He 

245 250 255 

He Tyr Gly Pro Asp Gly Arg Ala Leu Cys Glu Lys Leu Pro Glu Asn 

260 265 270 

Gin Glu Gly Leu Leu Tyr Ala Glu He Asp Leu Gly Val He Ser Met 

275 280 285 

Ala Lys Ser Ala Met Asp Pro Val Gly His Tyr Ser Arg Pro Asp Val 

290 295 300 

Tyr Arg Val Leu Phe Asn Lys He Pro Ala Lys Arg He Glu His Phe 
305 310 315 320 

Asn Leu Pro Leu Asp Glu Gin Ala Gly Glu Glu Pro Pro Ala Asp 
325 330 335 

<210> 217 
<211> 1011 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
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<400> 217 

gtgggtatca gccatccgaa attcaaggct gcggtcgtac aggccgggcc 
gacctcgacg gcggcgtcga acgagccgtg tcgctcatcg gccaagcagc 
gcacagctga ttgcctttcc tgaaacgtgg attcccggtt acccgtggca 
ggcagcccgg cgtgggcaat ggaaaaaggc tttgtccaac gatatttcga 
cggcatggtt ctccgcaagc cgagcgaatc tccggggctg cggcggagca 
gtgtcgcttg ggtttgcgga acgcgatgga ggcacgcttt atatcgcgca 
ggacccgacg gccaaactat ctcacgacgg cggaagctta agccgactca 
actgtatttg gcgagggaga cggaagcgat ctctccgtgc atgatacggc 
atcggctcac tttgttgctg ggagcatttg caaccgttgt cgaaatacgc 
cagaatgaac agattcacat tggcgcatgg cccagctttt cgctatacca 
aatgcgctga gtcccgaagt caatatcgca gtaagccgcg tgtacgccgt 
tgtttcttcc tcgcgccgtg cgcgacggtt tcggacgcca tgatcgaaac 
acgcccgaaa agcagggact gattcgggcg ggtggcgggc acgccgcgat 
gatggaagtc tgctgacgcc tacggtagcg gatacttacg agggcctgct 
ctcgacctcg gcgtcatttc gatcgccaag agtgcagcgg accccgccgg 
cggccagatg tcacacgcct tctattgaat cagacgcctt cgaagcgcgt 
gtgttaccac tggagacggt cacggagccc gaaggcccgg ttcagcccta 

<210> 218 
<211> 336 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 218 



Val 


Gly 


He 


Ser 


His 


Pro 


Lys 


Phe 


Lys 


Ala 


Ala 


Val 


Val 


Gin 


Ala 


Gly 


1 






5 










10 










15 




Pro 


Ala 


Phe 


Leu 


Asp 


Leu 


Asp 


Gly 


Gly 


Val 


Glu 


Arg 


Ala 


Val 


Ser 


Leu 








20 










25 










30 






He 


Gly 


Gin 


Ala 


Ala 


Ala 


Glu 


Gly 


Ala 


Gin 


Leu 


He 


Ala 


Phe 


Pro 


Glu 




35 










40 










45 








Thr 


Trp 


He 


Pro 


Gly 


Tyr 


Pro 


Trp 


His 


Thr 


Trp 


Leu 


Gly 


Ser 


Pro 


Ala 




50 










55 










60 










Trp 


Ala 


Met 


Glu 


Lys 


Gly 


Phe 


Val 


Gin 


Arg 


Tyr 


Phe 


Asp 


Asn 


Ala 


Leu 


65 










70 










75 










80 


Arg 


His 


Gly 


Ser 


Pro 


Gin 


Ala 


Glu 


Arg 


He 


Ser 


Gly 


Ala 


Ala 


Ala 


Glu 






85 










90 










95 




His 


Lys 


He 


Met 


Val 


Ser 


Leu 


Gly 


Phe 


Ala 


Glu 


Arg 


Asp 


Gly 


Gly 


Thr 






100 










105 










110 






Leu 


Tyr 


He 


Ala 


Gin 


Trp 


Leu 


He 


Gly 


Pro 


Asp 


Gly 


Gin 


Thr 


He 


Ser 




115 










120 










125 








Arg 


Arg 


Arg 


Lys 


Leu 


Lys 


Pro 


Thr 


His 


Val 


Glu 


Arg 


Thr 


Val 


Phe 


Gly 




130 










135 










140 










Glu 


Gly Asp 


Gly 


Ser 


Asp 


Leu 


Ser 


Val 


His 


Asp 


Thr 


Ala 


Leu 


Gly 


Arg 


145 










150 










155 










160 


He 


Gly 


Ser 


Leu 


Cys 


Cys 


Trp 


Glu 


His 


Leu 


Gin 


Pro 


Leu 


Ser 


Lys 


Tyr 








165 










170 










175 




Ala 


Met 


Tyr 


Ala 


Gin 


Asn 


Glu 


Gin 


He 


His 


He 


Gly 


Ala 


Trp 


Pro 


Ser 






180 










185 










190 






Phe 


Ser 


Leu 


Tyr 


Gin 


Pro 


Phe 


Ala 


Asn 


Ala 


Leu 


Ser 


Pro 


Glu 


Val 


Asn 






195 








200 










205 








He 


Ala 


Val 


Ser 


Arg 


Val 


Tyr 


Ala 


Val 


Glu 


Gly 


Gin 


Cys 


Phe 


Phe 


Leu 




210 










215 










220 










Ala 


Pro 


Cys 


Ala 


Thr 


Val 


Ser 


Asp 


Ala 


Met 


He 


Glu 


Thr 


Leu 


Cys 


Asp 


225 








230 










235 










240 


Thr 


Pro 


Glu 


Lys 


Gin 


Gly 


Leu 


He 


Arg 


Ala 


Gly 


Gly 


Gly 


His 


Ala 


Ala 










245 










250 










255 




He 


Phe 


Gly 


Pro 


Asp 


Gly 


Ser 


Leu 


Leu 


Thr 


Pro 


Thr 


Val 


Ala 


Asp 


Thr 



tgccttcctc 60 

ggccgaaggg 12 0 

cacctggctt 180 

caacgcgttg 240 

caagattatg 300 

gtggctcatc 360 

cgtcgagcgc 420 

gcttggacgt 4 80 

aatgtacgcc 540 

gccatttgcg 600 

ggaaggccag 660 

actgtgcgat 720 

cttcggccca 780 

gtatgcagaa 840 

ccactattcg 900 

tcagaatatg 960 

g 1011 
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260 265 270 

Tyr Glu Gly Leu Leu Tyr Ala Glu Leu Asp Leu Gly Val lie Ser lie 

275 280 285 

Ala Lys Ser Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp Val 

290 295 3°° 

Thr Arg Leu Leu Leu Asn Gin Thr Pro Ser Lys Arg Val Gin Asn Met 
305 310 315 320 

Val Leu Pro Leu Glu Thr Val Thr Glu Pro Glu Gly Pro Val Gin Pro 
325 330 335 

<210> 219 
<211> 996 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

atgaaaatcg tcaaagctgc cgccgttcaa attagtcccg tgctttatag ccgcgatgga 
acggtggaaa aagttgtgcg gaagatccat gagctcggcc aacagggagt gcagttcgca 
accttccccg agaccgtgat tccgtactat ccctattttt cctttctgca acccgcctac 
cagatcgcgg ccgggcagga gcacctcaag ttgctcgacc aggcggtgac ggtgccgtcc 
gctgccacgc acgcgatcgg ccaggcatgc aaacaggcag gagtggttgt ttcgatcggc 
atcaatgagc gggacaatgg aacgatctac aacacccagc ttctcttcga ctccgacgga 
accctgctgc agcggcgccg caagatctca cccacttacc acgaacggat gatctggggc 
cagggcgacg gttccggcct gcgcgccgtg gatagcaagg ccggccggat tgggcagctc 
gcctgttggg agcactacaa ccctttggcg cgctacgcga tgatggcoga cggcgagcag 
atccattccg ccatgtaccc tggctccatc ttcggcgatc tgtttgccga acagacccag 
atcaacgtcc ggcagcacgc gttggagtcc ggctgtttcg tagtctgctc caccgcctgg 
ttggatcccg atcaacaagc gcaaatcgtc aaggacacag gcggcgccat tggtgcgatc 
tcgggcggct gtttcacggc gattgtggcc cccgatagca ctctgctcgg agagccgatc 
cgctccggcg aaggcgtggt cattgcggat ctcgacttca cccggatcga caagcgtaaa 
cagttgatgg attcccgagg tcattacagc cggccggaat tgctgagtct gctgattgat 
cgcacttcca ccgcacatgt gcaaaaccgc gtgccggccg atccattcgg cgccggctgt 
gttgcagaac caggagtaga aacatgccca cgatag 

<210> 220 
<211> 331 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

Me?°Lys 2 Ile Val Lys Ala Ala Ala Val Gin He Ser Pro Val Leu Tyr 

1 5 10 15 

Ser Arg Asp Gly Thr Val Glu Lys Val Val Arg Lys He His Glu .Leu 

20 25 30 

Gly Gin Gin Gly Val Gin Phe Ala Thr Phe Pro Glu Thr Val He Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser Phe Leu Gin Pro Ala Tyr Gin He Ala Ala 

50 55 60 

Gly Gin Glu His Leu Lys Leu Leu Asp Gin Ala Val Thr Val Pro Ser 
65 70 75 su 

Ala Ala Thr His Ala He Gly Gin Ala Cys Lys Gin Ala Gly Val Val 

85 90 95 

Val'Ser He Gly He Asn Glu Arg Asp Asn Gly Thr He Tyr Asn Thr 

100 105 11° 

Gin Leu Leu Phe Asp Ser Asp Gly Thr Leu Leu Gin Arg Arg Arg Lys 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
996 
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115 120 125 



lie 


Ser 


Pro 


Thr 


Tyr 


His 


Glu 


Arg 


Met 


He 


Trp 


Gly 


Gin 


Gly 


Asp 


Gly 




130 










135 










140 










Ser 


Gly 


Leu 


Arg 


Ala 


Val 


Asp 


Ser 


Lys 


Ala 


Gly 


Arg 


He 


Gly 


Gin 


Leu 


145 








150 










155 










160 


Ala 


Cys 


Trp 


Glu 


His 


Tyr 


Asn 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ala 


Met 


Met 


Ala 






165 










170 










175 




Asp 


Gly 


Glu 


Gin 


He 


His 


Ser 


Ala 


Met 


Tyr 


Pro 


Gly 


Ser 


He 


Phe 


Gly 




180 










185 










190 






Asp 


Leu 


Phe 


Ala 


Glu 


Gin 


Thr 


Gin 


He 


Asn 


Val 


Arg 


Gin 


His 


Ala 


Leu 




195 










200 










205 








Glu 


Ser 


Gly 


Cys 


Phe 


Val 


Val 


Cys 


Ser 


Thr 


Ala 


Trp 


Leu 


Asp 


Pro 


Asp 




210 










215 










220 










Gin 


Gin 


Ala 


Gin 


He 


Val 


Lys 


Asp 


Thr 


Gly 


Gly 


Ala 


He 


Gly Ala 


He 


225 










230 










235 










240 


Ser 


Gly 


Gly 


Cys 


Phe 


Thr 


Ala 


He 


Val 


Ala 


Pro 


Asp 


Ser 


Thr 


Leu 


Leu 






245 










250 










255 




Gly 


Glu 


Pro 


He 


Arg 


Ser 


Gly 


Glu 


Gly 


Val 


Val 


He 


Ala 


Asp 


Leu 


Asp 






260 










265 










270 






Phe 


Thr 


Arg 


He 


Asp 


Lys 


Arg 


Lys 


Gin 


Leu 


Met 


Asp 


Ser 


Arg 


Gly 


His 






275 










280 










285 








Tyr 


Ser 


Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


Leu 


Leu 


He 


Asp 


Arg 


Thr 


Ser 


Thr 


290 








295 










300 










Ala 


His 


Val 


Gin 


Asn 


Arg 


Val 


Pro 


Ala 


Asp 


Pro 


Phe 


Gly 


Ala 


Gly 


Cys 


305 










310 










315 










320 


Val 


Ala 


Glu 


Pro 


Gly Val 


Glu 


Thr 


Cys 


Pro 


Arg 













325 330 



<210> 221 
<211> 1146 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 221 

atgttgacct 

gacgctgacg 

gcgaagtttg 

ggcactgcga 

ggcgacaagc 

atgggttacg 

agcggaaaaa 

ttcggcgagg 

ggcttcaact 

gaacaggtgc 

ttaggtgcag 

gtgctgggag 

gagcatcaca 

gagtatctcg 

ctgagcaaga 

gacgtcttgt 

gagggccggc 

gacctcgagg 

gctaagccat 

gtctga 

<210> 222 
<211> 381 
<212> PRT 



acaaaggcgt 
ccaccattac 
tcgcattccc 
tgtggggagc 
ggatgcagcg 
gtgaacgtga 
ttgtggcgaa 
gaaacggatc 
gctgggaaca 
atgtcgcgtc 
gcgctaacga 
cgacactcgt 
agctgcttgg 
cagaatcgct 
tcctgctcgc 
cattgctcgt 
aggtttcaac 
ttacgccagc 
cgaccgttaa 



gttcaaagca 
caaagccatc 
ggaggtgttc 
caaattcgtc 
cattcaagct 
cggaggtagc 
ccgccgcaag 
cgatttcatc 
ccttcagccc 
gtggccggcc 
ggcggtcacc 
tatcggcaag 
catgggcgga 
cgctcatgac 
caaagcaaac 
caacacacac 
gtcaattcgc 
gacaccagcg 
gagcgagagt 



gcgacagttc 
cggatcatcg 
atccctggat 
gtgcccttcc 
gcggcaaaac 
cgctacatga 
ctcaaaccga 
acccatgact 
ctgagcaaat 
atgtgcacct 
cgctcttacg 
gcagcccatg 
gggtgggccc 
gcggagggca 
acggacacag 
aatcccgggc 
aggcacgaaa 
accctggaca 
aacgcgagca 



aggcggagcc 
aagaggcggc 
acccttggtg 
acgaaaattg 
aaaatggcat 
gccaggtatt 
cgcacgaaga 
tcccatttgc 
acatgatgta 
atcagcctga 
cgatcgaagg 
atgcattctg 
gaattttcgg 
tcctgtacgc 
ttggccacta 
ctgtacgcta 
aactggaagg 
tcgcgagcct 
caaaacaacc 



ggtgtggatg 
tgataacggg 
gatctggctg 
ccttgagctt 
tgcactcgtc 
catcgacgat 
gcgaacgatt 
gagagtcgga 
cagcctgcag 
cgtgccccag 
cgcatgttat 
cgacactgaa 
cccggacggt 
tgacattgat 
cgcgcgacct 
tcttgacgaa 
ccagagcctc 
ggtacaacag 
ggatctcgcg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1146 
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<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 222 
Met Leu Thr Tyr 
1 

Pro Val Trp Met 
20 

lie Glu Glu Ala 
35 

Val Phe lie Pro 
50 

Trp Gly Ala Lys 
65 

Gly Asp Lys Arg 

lie Ala Leu Val 
100 

Met Ser Gin Val 
115 

Arg Lys Leu Lys 
130 

Asn Gly Ser Asp 
145 

Gly Phe Asn Cys 

Tyr Ser Leu Gin 
180 

Thr Tyr Gin Pro 
195 

Val Thr Arg Ser 

210 

Thr Leu Val lie 
225 

Glu His His Lys 

Gly Pro Asp Gly 
260 

Gly lie Leu Tyr 
275 

Ala Asn Thr Asp 
290 

Leu Leu Val Asn 
305 

Glu Gly Arg Gin 

Gly Gin Ser Leu 
340 

Asp lie Ala Ser 
355 

Glu Ser Asn Ala 
370 



Lys Gly Val Phe 
5 

Asp Ala Asp Ala 

Ala Asp Asn Gly 
40 

Gly Tyr Pro Trp 
55 

Phe Val Val Pro 
70 

Met Gin Arg lie 
85 

Met Gly Tyr Gly 

Phe lie Asp Asp 
120 

Pro Thr His Glu 
135 

Phe lie Thr His 
150 

Trp Glu His Leu 
165 

Glu Gin Val His 

Asp Val Pro Gin 
200 

Tyr Ala He Glu 
215 

Gly Lys Ala Ala 

230 

Leu Leu Gly Met 
245 

Glu Tyr Leu Ala 

Ala Asp He Asp 
280 

Thr Val Gly His 
295 

Thr His Asn Pro 
310 

Val Ser Thr Ser 
325 

Asp Leu Glu Val 

Leu Val Gin Gin 
360 

Ser Thr Lys Gin 
375 



Lys Ala Ala Thr 
10 

Thr He Thr Lys 

25 

Ala Lys Phe Val 

Trp He Trp Leu 
60 

Phe His Glu Asn 
75 

Gin Ala Ala Ala 
90 

Glu Arg Asp Gly 
105 

Ser Gly Lys He 

Glu Arg Thr He 
140 

Asp Phe Pro Phe 
155 

Gin Pro Leu Ser 
170 

Val Ala Ser Trp 
185 

Leu Gly Ala Gly 

Gly Ala Cys Tyr 
220 

His Asp Ala Phe 
235 

Gly Gly Gly Trp 
250 

Glu Ser Leu Ala 
265 

Leu Ser Lys He 

Tyr Ala Arg Pro 
300 

Gly Pro Val Arg 
315 

He Arg Arg His 
330 

Thr Pro Ala Thr 
345 

Ala Lys Pro Ser 

Pro Asp Leu Ala 
380 



Val Gin Ala Glu 
15 

Ala He Arg He 
30 

Ala Phe Pro Glu 
45 

Gly Thr Ala Met 

Cys Leu Glu Leu 
80 

Lys Gin Asn Gly 
95 

Gly Ser Arg Tyr 
110 

Val Ala Asn Arg 
125 

Phe Gly Glu Gly 

Ala Arg Val Gly 
160 

Lys Tyr Met Met 
175 

Pro Ala Met Cys 
190 

Ala Asn Glu Ala 
205 

Val Leu Gly Ala 

Cys Asp Thr Glu 
240 

Ala Arg He Phe 
255 

His Asp Ala Glu 
270 

Leu Leu Ala Lys 
285 

Asp Val Leu Ser 

Tyr Leu Asp Glu 
320 

Glu Lys Leu Glu 
335 

Pro Ala Thr Leu 
350 

Thr Val Lys Ser 
365 
Val 



<210> 223 
<211> 996 
<212> DNA 
<213> Unknown 



<220> 
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<223> Obtained from an environmental sample 



<400> 223 

atgagagttg 

acggtcgaaa 

gcgttccctg 

caaatccttc 

tatagcactg 

gtcaacgagc 

acgttgatcc 

cagggcgatg 

gcatgctttg 

atccattccg 

attgccgtaa 

ctcgatcccg 

atctccgggg 

atccgctcgg 

aaacatattg 

gatcgcactc 

ggagcgtcgg 



ttaaagccag 
aggtcgtagc 
agaccgtggt 
gcggcggtga 
aagccatcgg 
gtgatggcgg 
aaagacgacg 
gctcaggtct 
agcactacaa 
ccatgttccc 
ggcaacatgc 
gccagcaggc 
gttgcttcac 
gtgaaggcga 
tcgactcgag 
ccaccgcgca 
aacatcttcg 



cgcagttcaa 
gaagattcat 
gccttattac 
gttcgtcaag 
cgaagcctgc 
aacgatctac 
caagatcacc 
gcgcgctgtt 
ccctcttgcg 
cggttccatg 
gatggagtcc 
tcagatcgcc 
tgcgatcatc 
agtcatcgtc 
aggacattac 
ccttcacgac 
cgaagacgcc 



ctgaagcctg 
gagctagggc 
ccgtactttt 
ctcctcgatc 
aggcaggagg 
aacgcgcagt 
cccacccatt 
gacagcaatg 
cggtacgcga 
ttcggcgatg 
gggtgctttg 
aacgacaccg 
gcacccgatg 
gacctcgatt 
agccggccag 
cgcgctgtgc 
gcctga 



tactttatag 
agcagggtgt 
cgatcgtgca 
agtccgtgac 
agatggttgt 
tactctttga 
acgaacgcat 
tcgcacgcat 
tgatggccga 
gttttgcgga 
tcgtttgcgc 
gtatcaccga 
ggagcctgct 
tcacgttaat 
aattgctgag 
agcacaatgc 



<210> 224 
<211> 331 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



ccgcgagggg 
gcagttcgcc 
gtccggctat 
ggtgccatct 
ctccataggc 
ttccgacggc 
gatctggggc 
tggtcaactg 
tggcgagcag 
gaggacggaa 
tacggcctgg 
catcggcccg 
gggccaacct 
tgacaagcgg 
cctgctgatc 
cggatcggaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
996 



<400> 224 

Met Arg Val Val Lys Ala Ser Ala Val Gin Leu Lys Pro Val Leu Tyr 

15 10 15 

Ser Arg Glu Gly Thr Val Glu Lys Val Val Ala Lys lie His Glu Leu 

20 25 30 

Gly Gin Gin Gly Val Gin Phe Ala Ala Phe Pro Glu Thr Val Val Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser He Val Gin Ser Gly Tyr Gin He Leu Arg 

50 55 60 

Gly Gly Glu Phe Val Lys Leu Leu Asp Gin Ser Val Thr Val Pro Ser 
65 70 75 80 

Tyr Ser Thr Glu Ala He Gly Glu Ala Cys Arg Gin Glu Glu Met Val 

85 90 95 

Val Ser He Gly Val Asn Glu Arg Asp Gly Gly Thr He Tyr Asn Ala 

100 105 HO 

Gin Leu Leu Phe Asp Ser Asp Gly Thr Leu He Gin Arg Arg Arg Lys 

115 120 125 

He Thr Pro Thr His Tyr Glu Arg Met He Trp Gly Gin Gly Asp Gly 

130 135 140 

Ser Gly Leu Arg Ala Val Asp Ser Asn Val Ala Arg He Gly Gin Leu 
145 150 155 160 

Ala Cys Phe Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Met Met Ala 

165 170 175 

Asp Gly Glu Gin He His Ser Ala Met Phe Pro Gly Ser Met Phe Gly 

180 185 190 

Asp Gly Phe Ala Glu Arg Thr Glu He Ala Val Arg Gin His Ala Met 

195 200 205 

Glu Ser Gly Cys Phe Val Val Cys Ala Thr Ala Trp Leu Asp Pro Gly 

210 215 220 

Gin Gin Ala Gin He Ala Asn Asp Thr Gly He Thr Asp He Gly Pro 
225 230 235 240 

He Ser Gly Gly Cys Phe Thr Ala He He Ala Pro Asp Gly Ser Leu 
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245 

Leu Gly Gin Pro lie 
260 

Asp Phe Thr Leu lie 
275 

His Tyr Ser Arg Pro 
290 

Thr Ala His Leu His 
305 

Gly Ala Ser Glu His 
325 



250 

Arg Ser Gly Glu Gly 
265 

Asp Lys Arg Lys His 
280 

Glu Leu Leu Ser Leu 
295 

Asp Arg Ala Val Gin 
310 

Leu Arg Glu Asp Ala 
330 



255 

Glu Val lie Val Asp Leu 
270 

lie Val Asp Ser Arg Gly 

285 

Leu lie Asp Arg Thr Pro 
300 

His Asn Ala Gly Ser Glu 
315 320 
Ala 



<210> 225 
<211> 951 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 225 

atgaccaccg 

accatcgaga 

acgtttccag 

gaaatgtcgg 

acgcacgcca 

gagcgggagg 

atccagcgtc 

gacggttccg 

tgggaacacc 

gcggcgatgt 

atccggcagc 

gaggaacagc 

ggttgcttta 

ggcgaaggcg 

atggactcac 

cctacatccc 



tcaaagccgc 
aaatttgtcg 
agacagttat 
cccaatatca 
ttggggccgc 
gcggcacgct 
gccgcaagat 
ggctgcgagc 
acaatccgct 
accctggctc 
acgccatgga 
aagcccgaat 
ccgccattgt 
tcgtgattgc 
gcggcctcta 
atgtgcatga 



cgctgtccag 
acagatcatc 
tccgtattat 
tcagttgctt 
ctgtaaacaa 
ctatggcaca 
cactccgacc 
cgtagatagc 
ggctcgttat 
gatcttgggt 
atctggttgc 
catgaaggac 
cgcgcccaac 
cgatttggac 
tagccgccct 
acgggttgag 



atgagtcccg 
gagcttggcc 
ccatatttcg 
gatcaagcgg 
gccggaattg 
cagttgctgt 
taccacgagc 
gcggtcggtc 
gctttagcgg 
gaactatttg 
ttcgtcgtga 
accggatcat 
gggcagataa 
ttcgctttga 
gagttgctaa 
cgtagcatgg 



tgctttacag 
gacagggagt 
ccttcgtgca 
tgaccgtgcc 
ttgtctcaat 
ttgatgcaga 
ggatgatttg 
gaatcggaca 
ccgatggcga 
ccgagcagat 
acgccacggc 
tcgatagccc 
tcggtgaacc 
ttgatgagag 
gcttgttaat 
cgatggcatg 



tcgcgacgat 
gcagttcgcc 
gcggccttac 
ttccggttca 
cggcgtcaac 
cggcctcttg 
gggacagggt 
gttggcgtgt 
acaaattcac 
tcaggtcaac 
ctggctaagc 
aatcaccggt 
gctgcgcatc 
gaagcggctg 
cgacagaatg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
951 



<210> 226 
<211> 316 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
<400> 226 



Met 


Thr 


Thr 


Val 


Lys 


Ala 


Ala 


Ala 


Val 


Gin 


Met 


Ser 


Pro 


Val 


Leu 


Tyr 


1 








5 










10 










15 




Ser 


Arg 


Asp 


Asp 


Thr 


He 


Glu 


Lys 


He 


Cys 


Arg 


Gin 


He 


He 


Glu 


Leu 




20 










25 










30 






Gly Arg 


Gin 


Gly 


Val 


Gin 


Phe 


Ala 


Thr 


Phe 


Pro 


Glu 


Thr 


Val 


He 


Pro 






35 








40 










45 






Ala 


Tyr 


Tyr 


Pro 


Tyr 


Phe 


Ala 


Phe 


Val 


Gin 


Arg 


Pro 


Tyr 


Glu 


Met 


Ser 


50 








55 










60 










Gin 


Tyr 


His 


Glri 


Leu 


Leu 


Asp 


Gin 


Ala 


Val 


Thr 


Val 


Pro 


Ser 


Gly 


Ser 


65 








70 








75 










80 


Thr 


His 


Ala 


He 


Gly 


Ala 


Ala 


Cys 


Lys 


Gin 


Ala 


Gly 


He 


Val 


Val 


Ser 










85 








90 










95 




He 


Gly 


Val 


Asn 


Glu 


Arg 


Glu 


Gly 


Gly 


Thr 


Leu 


Tyr 


Gly 


Thr 


Gin 


Leu 






100 








105 










110 
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Leu 


Phe 


Asp 


Ala 


Asp 


Gly 


Leu 


Leu 


He 


Gin 


Arg 


Arg 


Arg 


Lys 


He 


Thr 






115 










120 










125 








Pro 


Thr 


Tyr 


His 


Glu 


Arg 


Met 


He 


Trp 


Gly 


Gin 


Gly Asp 


Gly 


Ser 


Gly 




130 








135 










140 










Leu 


Arg 


Ala 


Val 


Asp 


Ser 


Ala 


Val 


Gly 


Arg 


He 


Gly 


Gin 


Leu 


Ala 


Cys 


145 






150 










155 










160 


Trp 


Glu 


His 


His 


Asn 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ala 


Leu 


Ala 


Ala 


Asp 


Gly 








165 










170 










175 




Glu 


Gin 


He 


His 


Ala 


Ala 


Met 


Tyr 


Pro 


Gly 


Ser 


He 


Leu 


Gly 


Glu 


Leu 








180 










185 










190 






Phe 


Ala 


Glu 


Gin 


He 


Gin 


Val 


Asn 


He 


Arg 


Gin 


His 


Ala 


Met 


Glu 


Ser 






195 










200 










205 








Gly 


Cys 


Phe 


Val 


Val 


Asn 


Ala 


Thr 


Ala 


Trp 


Leu 


Ser 


Glu 


Glu 


Gin 


Gin 


210 










215 










220 










Ala 


Arg 


lie 


Met 


Lys 


Asp 


Thr 


Gly 


Ser 


Phe 


Asp 


Ser 


Pro 


He 


Thr 


Gly 


225 








230 










235 










240 


Gly 


Cys 


Phe 


Thr 


Ala 


He 


Val 


Ala 


Pro 


Asn 


Gly 


Gin 


He 


He 


Gly 


Glu 






245 










250 










255 




Pro 


Leu 


Arg 


He 


Gly 


Glu 


Gly 


Val 


Val 


He 


Ala 


Asp 


Leu 


Asp 


Phe 


Ala 






260 










265 










270 






Leu 


He 


Asp 


Glu 


Arg 


Lys 


Arg 


Leu 


Met 


Asp 


Ser 


Arg 


Gly 


Leu 


Tyr 


Ser 






275 










280 










285 






His 


Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


Leu 


Leu 


He 


Asp 


Arg 


Met 


Pro 


Thr 


Ser 


290 










295 










300 










Val 


His 


Glu 


Arg 


Val 


Glu 


Arg 


Ser 


Met 


Ala 


Met 


Ala 










305 










310 










315 













<210> 227 
<211> 1035 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



60 



<400> 227 

atgtccgatc gacgcatcgt tcgcgcggca gccattcaga tcgccccgga tctcgagcgc 

tgcagtgtaa cgctcgagaa ggtttgctcg gcgatcgacg aagccgcggg caaaggcgct 120 

caactgagcg tctttcccga gaccttcgtc ccgtactacc cctatttctc gtttgtccgc 180 

cctccggtcg catcgggggc cgatcacatg cggctatacg aggaagcggt ggtggtgccg 240 

ggtccc,gtga cgcaggctgt ttccgaacgt gcgcgcatgc atgggatggt ggtggtgctc 300 

ggcgtcaacg agcgcgacca cgggagcctc tacaacaccc aactgatttt cgattgcgat 360 

ggtcggctcg ctctgaaacg ccgcaagatc acgccgacgt ttcacgagcg catgatctgg 420 

ggccagggcg acgccagtgg actgaaagtc gcgcgcacgg gtatcgggcg ggtcggcgcg 48 0 

ctcgcgtgct gggagcatta caacccgctc gcgcgctacg cgctgatgac ccagcacgaa 54 0 

gagatccatt gcagtcaatt tccaggctcg ctcgtcggtc cgatcttctc cgagcagatg 600 

gacgtgacga tccgccatca cgccctcgaa tcggggtgct tcgtcgtgaa cgcaaccggt 660 

tggctcacgg acgcgcagat cgcctcgatc accgatgacc cgaagcttca acgggcgctg 72 0 

cgcggtggct gcaacacggc gatcgtttca ccggaaggcc agcatctggc ggagccgttg 780 

cgcgaaggcg aggggatggt ggtcgcggac ctcgacatgt cgctcatcac caagcgcaag 84 0 

cgaatgatgg attcggtcgg ccactatgcg cggccggaac tgttgagcct cgcgatcaac 900 

gatcgtcccg cagccccttt cggccggatg tgcgctgccg aagcaatgcg gggagccgac 960 

gacgtcgtca cagtaggagc atttcatgag cgccagcgag aacgtgtcgg cgaagagccg 1020 
gcaattgatg actga 

<210> 228 
<211> 344 
<212> PRT 
<213> Unknown 

<220> 



1035 
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<223> Obtained from an environmental sample 

Set°Ser 2 Asp Arg Arg He Val Arg Ala Ala Ala He Gin He Ala Pro 

1 5 10 15 

Asp Leu Glu Arg Cys Ser Val Thr Leu Glu Lys Val Cys Ser Ala lie 

20 25 30 

Asp Glu Ala Ala Gly Lys Gly Ala Gin Leu Ser Val Phe Pro Glu Thr 

35 40 45 

Phe Val Pro Tyr Tyr Pro Tyr Phe Ser Phe Val Arg Pro Pro Val Ala 

50 55 60 

Ser Gly Ala Asp His Met Arg Leu Tyr Glu Glu Ala Val Val Val Pro 

65 70 ^ 

Gly Pro Val Thr Gin Ala Val Ser Glu Arg Ala Arg Met His Gly Met 

85 90 95 

Val Val Val Leu Gly Val Asn Glu Arg Asp His Gly Ser Leu Tyr Asn 

100 105 HO 

Thr Gin Leu He Phe Asp Cys Asp Gly Arg Leu Ala Leu Lys Arg Arg 

115 120 125 

Lvs He Thr Pro Thr Phe His Glu Arg Met lie Trp Gly Gin- Gly Asp 

130 135 i4 ° 

Ala Ser Gly Leu Lys Val Ala Arg Thr Gly He Gly Arg Val Gly Ala 
145 150 155 

Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met 
165 I 70 I 15 „ i 

Thr Gin His Glu Glu He His Cys Ser Gin Phe Pro Gly Ser Leu Val 

- „^ 1 Qc: 190 



180 j_ u -/ 

Gly Pro He Phe Ser Glu Gin Met Asp Val Thr He Arg His His Ala 

19 5 200 205 

Leu Glu Ser Gly Cys Phe Val Val Asn Ala Thr Gly Trp Leu Thr Asp 

210 215 220 

Ala Gin He Ala Ser He Thr Asp Asp Pro Lys Leu Gin Arg Ala Leu 
225 230 2 35 240 

Arg Gly Gly Cys Asn Thr Ala He Val Ser Pro Glu Gly Gin His Leu 

245 250 ZDD 

Ala Glu Pro Leu Arg Glu Gly Glu Gly Met Val Val Ala Asp Leu Asp 

260 265 2/0 

Met Ser Leu He Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His 

275 280 285 

Tyr Ala Arg Pro Glu Leu Leu Ser Leu Ala He Asn Asp Arg Pro Ala 

290 295 300 , , 

Ala Pro Phe Gly Arg Met Cys Ala Ala Glu Ala Met Arg Gly Ala Asp 

305 310 315 

Asp Val Val Thr Val Gly Ala Phe His Glu Arg Gin Arg Glu Arg Val 

325 330 335 

Gly Glu Glu Pro Ala He Asp Asp 
340 

<210> 229 
<211> 975 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

aJgcctgaaa acaaagtcaa agttgccctc gttcagcatc cgcctgtttt tctgaatttg 60 

ccgaaaacgt tggaaaaagt agaaggtttg gcgcgagagt ^gccgccaa tgaagcgaaa 120 

atccttqtct ttcctgaaac ctggctgacc ggctatccgg tctggcttga tgaagcgccg xo 

agagccgcgc tgtgggatta tccgcccgcc aagcgtttgt atcaatacct cacggaaaat 240 
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tcattgcaga 
ctttatttag 
tatctcacgc 
agacttgtct 
attctcggag 
tcaaaaaatg 
gccagccgtc 
acgaaaacgg 
gggcttttgg 
cccgatttga 
ttaaatctcg 
cgtcccgatg 
tccgaaacag 



ttccgagcgc 
tcgttggagt 
ccgacggcag 
ggggcgcagg 
gtttgatttg 
aagcgattca 
attacgcctt 
atgttttgga 
attcgataga 
gctattcggt 
atttaaccaa 
ttttcgagtt 
tatag 



tgagtttgaa 
tcacgaacga 
ttataaaact 
cgacggaagc 
ctgggaacat 
cgtttgccaa 
cgaagggcag 
aggttttgaa 
aaaggaagaa 
cgagccggtt 
gcagggacat 
gcgcgtcaac 



tctttgcgcg 
agcggcggaa 
caccgcaaat 
ggtctaaatg 
tggatgcctc 
tttcccacgg 
tgttttgtct 
tcgctcgaaa 
ctgatgcgtg 
tttgacgaaa 
ctgtttttgg 
gataaagcga 



aaatcgctaa 
cgctctacaa 
tggttccaac 
ttgtggaaac 
tggctagggc 
ttcacgagcg 
tgacttccgg 
caaacgacca 
gcggaagcgc 
aaacgattgt 
ataccgacgg 
accgaaacgt 



gaaaaattcc 
tacgataatt 
ttacacggaa 
gccttacggg 
ggcaatgcat 
acatcaaatc 
ttgcgcgatg 
cgaagttttc 
gattattgcg 
ttacggcgaa 
acattattcg 
ccgttttgca 



<210> 230 
<211> 324 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
975 



<400> 230 

Met Pro Glu Asn Lys Val Lys Val Ala Leu Val Gin His Pro Pro Val 

15 10 15 

Phe Leu Asn Leu Pro Lys Thr Leu Glu Lys Val Glu Gly Leu Ala Arg 

20 25 30 

Glu Cys Ala Ala Asn Glu Ala Lys He Val Val Phe Pro Glu Thr Trp 

35 40 45 

Leu Thr Gly Tyr Pro Val Trp Leu Asp Glu Ala Pro Arg Ala Ala Leu 

50 55 60 

Trp Asp Tyr Pro Pro Ala Lys Arg Leu Tyr Gin Tyr Leu Thr Glu Asn 
65 70 75 80 

Ser Leu Gin He Pro Ser Ala Glu Phe Glu Ser Leu Arg Glu He Ala 

85 90 95 

Lys Lys Asn Ser Leu Tyr Leu Val Val Gly Val His Glu Arg Ser Gly 

100 105 HO 

Gly Thr Leu Tyr Asn Thr He He Tyr Leu Thr Pro Asp Gly Ser Tyr 

115 120 125 

Lys Thr His Arg Lys Leu Val Pro Thr Tyr Thr Glu Arg Leu Val Trp 

130 135 140 

Gly Ala Gly Asp Gly Ser Gly Leu Asn Val Val Glu Thr Pro Tyr Gly 
145 150 155 160 

He Leu Gly Gly Leu He Cys Trp Glu His Trp Met Pro Leu Ala Arg 

165 170 175 

Ala Ala Met His Ser Lys Asn Glu Ala He His Val Cys Gin Phe Pro 

180 185 190 

Thr Val His Glu Arg His Gin He Ala Ser Arg His Tyr Ala Phe Glu 

195 200 205 

Gly Gin Cys Phe Val Leu Thr Ser Gly Cys Ala Met Thr Lys Thr Asp 

210 215 220 

Val Leu Glu Gly Phe Glu Ser Leu Glu Thr Asn Asp His Glu Val Phe 
225 230 235 240 

Gly Leu Leu Asp Ser He Glu Lys Glu Glu Leu Met Arg Gly Gly Ser 

245 250 255 

Ala He He Ala Pro Asp Leu Ser Tyr Ser Val Glu Pro Val Phe Asp 

260 265 270 

Glu Lys Thr He Val Tyr Gly Glu Leu Asn Leu Asp Leu Thr Lys Gin 

275 280 285 

Gly His Leu Phe Leu Asp Thr Asp Gly His Tyr Ser Arg Pro Asp Val 
290 295 300 
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Phe Glu Leu Arg Val Asn Asp Lys Ala Asn Arg Asn Val Arg Phe Ala 
305 310 315 320 

Ser Glu Thr Val 



<210> 231 
<211> 1062 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 231 

atgagtcaga 

gatctcgatg 

gcgaagttga 

ggcgctccgg 

caatatggca 

atcgccctcg 

gggcctggcg 

actgtattcg 

ttgggcgcgc 

cagaacgagc 

cacgcactcg 

tgcttcgtca 

acccccgaaa 

gacggctcgc 

atcgatctcg 

cgtcccgatg 

gcgttgccgg 

cccaaagtga 



tccatccaaa 
catcgatcga 
ttgcgtttcc 
cctgggcgat 
gtccggaagc 
gcctgtccga 
gcgaaacggt 
gcgaaggcga 
tttgttgctg 
aaattcacgt 
gcgccgaggt 
ttgcgccgtg 
agcatcaatt 
cactggccaa 
gcatgatctc 
taactcgcct 
tcgatcagga 
cggcgctcaa 



acttaaggtc 
aaagacaata 
ggaaacctgg 
catgcgtggc 
tgaacggctg 
gcgcgacggg 
tgcacaacgc 
tggttcacat 
ggaacatctg 
cgccgcctgg 
caataacgct 
cgcaacggtt 
ccttcacgtc 
acctcttccg 
ggtcgccaag 
gctgttcaac 
ggccgaagtg 
gccgtcgcag 



gcggccgtgc 
cgctatgtcg 
attcccggct 
ttcgtctcgc 
cgggacgccg 
ggcagtcttt 
cgcaagctca 
ctggccgtgc 
caaccgttgt 
ccgagcttct 
gcgagcaaga 
tcgcaggtga 
ggcggcggct 
ccagaccagg 
gccgcagccg 
aatcgcccgg 
gatagtccgc 
gctgcggagt 



aggctgcccc 
acgaagcggc 
acccatggtg 
gctatttcga 
ccaggcgcaa 
atatcgcgca 
agcccacgca 
atgatctcga 
cgaaatatgc 
cgctatacga 
tctatgcggt 
tgatcgatga 
tcgccgtcat 
agggacttct 
atcccgccgg 
caaaccgcgt 
tgaaggctcc 
ag 



cgcctttctc 
tgcggccggg 
gatctggctc 
caactcgctg 
caagatatac 
atggatcatc 
cgccgagcgc 
tattggaaga 
aatgtacgcc 
tccctttgca 
cgagggatcg 
gctctgcgat 
ttacggtccc 
ctatgccgac 
acattatgca 
cgagaagctg 
cgacgcatct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1062 



<210> 232 
<211> 353 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from 



an environmental sample 



<400> 232 

Met Ser Gin He His Pro Lys Leu Lys Val Ala Ala Val Gin Ala Ala 

15 10 15 

Pro Ala Phe Leu Asp Leu Asp Ala Ser lie Glu Lys Thr lie Arg Tyr 

20 25 30 

Val Asp Glu Ala Ala Ala Ala Gly Ala Lys Leu lie Ala Phe Pro Glu 

35 40 45 

Thr Trp He Pro Gly Tyr Pro Trp Trp He Trp Leu Gly Ala Pro Ala 

50 55 60 

Trp Ala He Met Arg Gly Phe Val Ser Arg Tyr Phe Asp Asn Ser Leu 
65 70 75 80 

Gin Tyr Gly Ser Pro Glu Ala Glu Arg Leu Arg Asp Ala Ala Arg Arg 

85 90 95 

Asn Lys He Tyr He Ala Leu Gly Leu Ser Glu Arg Asp Gly Gly Ser 

100 105 HO 

Leu Tyr He Ala Gin Trp He He Gly Pro Gly Gly Glu Thr Val Ala 

115 120 125 

Gin Arg Arg Lys Leu Lys Pro Thr His Ala Glu Arg Thr Val Phe Gly 

130 135 140 

Glu Gly Asp Gly Ser His Leu Ala Val His Asp Leu Asp He Gly Arg 
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145 










150 






Leu 


Gly 


Ala 


Leu 


Cys 


Cys 


Trp 


Glu 










165 








Ala 


Met 


Tyr 


Ala 


Gin 


Asn 


Glu 


Gin 








180 










Phe 


Ser 


Leu 


Tyr 


Asp 


Pro 


Phe 


Ala 






195 










200 


Asn 


Ala 


Ala 


Ser 


Lys 


He 


Tyr 


Ala 




210 










215 




Ala 


Pro 


Cys 


Ala 


Thr 


Val 


Ser 


Gin 


225 










230 






Thr 


Pro 


Glu 


Lys 


His 


Gin 


Phe 


Leu 










245 








He 


Tyr 


Gly 


Pro 


Asp 


Gly 


Ser 


Pro 








260 










Gin 


Glu 


Gly 


Leu 


Leu 


Tyr 


Ala 


Asp 






275 










280 


Ala 


Lys 


Ala 


Ala 


Ala 


Asp 


Pro 


Ala 




290 










295 




Thr 


Arg 


Leu 


Leu 


Phe 


Asn 


Asn 


Arg 


305 










310 






Ala 


Leu 


Pro 


Val 


Asp 


Gin 


Glu 


Ala 










325 








Pro 


Asp 


Ala 


Ser 


Pro 


Lys 


Val 


Thr 








340 










Glu 





















1 

-LOO 










160 


His 


Leu 


(j±n 


Pro 


LI 


oci 


Lys 


J-Y-t- 




1 / U 














lie 


llJLS 


v a_L 


7\ 1 -a 


t\JL Ct 




Pro 


Ser 


IOC 










J. ^3 \J 






His 


Ala 


Leu 








V CLJ- 


Asn 










one; 








vaj. 


blU 


f~* 1 * T 


Ser 


Cys 


it lie 


v a a. 


He 








220 










val 


Met 


i±e 


Asp 


Glu 


j_ieu 




Asp 






o o c. 












His 


Val 


Gly 


Gly 


Gly 


irne 


7\ "1 ■= 
Ala 


Val 




o c n 
ZOU 










zoo 




Leu 


Ala 


Lys 


Pro 


Leu 


Pro 


Pro 


ASp 


265 










OTA 

z / u 






He 


Asp 


Leu 


Gly Met 


±±e 


Ser 


V cl-L 










285 








Gly 


His 


Tyr 


Ala 


Arg 


Pro 


Asp 


Val 








300 










Pro 


Ala 


Asn 


Arg 


Val 


Glu 


Lys 


Leu 






315 










320 


Glu 


Val 


Asp- Ser 


Pro 


Leu 


Lys 


Ala 




330 










335 




Ala 


Leu 




Pro 


Ser 


Gin 


Ala 


Ala 


345 










350 







<210> 233 
<211> 1002 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 233 

gtgagcacca 

ggcaccgtcg 

gccaccttcc 

ttgeagatec 

tcccccgcca 

ggcgtcaacg 

ggcgcgctga 

ggcgagggcg 

ctggcctgct 

cagatccatt 

gaaatcaaca 

tggttggacg 

atetegggeg 

ctcacgtcgg 

aagcagacga 

gatcgaaege 

caaaeggctg 



tegtcaaage 
agegggtegt 
ccgagaccgt 
tcgccggccc 
eggacgegat 
agegtgaegg 
tccagcgtcg 
acggctccgg 
gggagcacaa 
cggccatgta 
tccggcagca 
ccgatcagca 
gctgcctcac 
gegagggega 
tggactcgcg 
egaegtcgaa 
gcgacacgga 



cgcggccgtg 
gaagaagatc 
catcccttac 
cgagcatctg 
cggccaggcc 
cggcaccctg 
ccgcaagatc 
cctgcgcgcc 
caaccccctg 
tccgggctcg 
tgcgctggaa 
ggegcaaate 
ggcgatcgtg 
ggtcatcgcc 
cggccactac 
cgtccatgag 
gcggacccgc 



cagatcagcc 
egggagctgg 
tacccgtact 
aagctgeteg 
gcccggcagg 
tacaacacgc 
aagcccaccc 
gtcgacagcc 
gcgcgctacg 
atgttcggcg 
teeggatget 
atgeaggaca 
gcgcccgacg 
gacctcgatt 
aaccgccccg 
cgcgccgcgc 
gaggtcctgt 



ccgtgctgta 
gcgaaaaggg 
tttccttcgt 
accagteggt 
caggaatggt 
agetgetett 
actacgagcg 
aggteggteg 
ccatgatggc 
acccgttcgc 
tegtegtctg 
cgggctgcgc 
gcacgttcct 
tcaagctgat 
aactgetcag 
acccgaaggt 
aa 



cagccgcgag 
cgtccagttc 
tcagacgccc 
gaccgtgccg 
ggtgtccatc 
cgacgcggac 
catgatctgg 
tateggecag 
egaeggegag 
ccagaagacg 
ctcgacggcc 
catcggcccg 
gggcgaaccg 
cgacaagcgc 
cctgctgatc 
egaggegtea 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1002 



<210> 234 
<211> 333 
<212> PRT 
<213> Unknown 



<220> 
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<223> Obtained from an environmental sample 
<400> 234 

Val Ser Thr lie Val Lys Ala Ala Ala Val Gin lie Ser Pro Val Leu 

15 10 15 

Tyr Ser Arg Glu Gly Thr Val Glu Arg Val Val Lys Lys lie Arg Glu 

20 25 30 

Leu Gly Glu Lys Gly Val Gin Phe Ala Thr Phe Pro Glu Thr Val lie 

35 40 45 

Pro Tyr Tyr Pro Tyr Phe Ser Phe Val Gin Thr Pro Leu Gin He Leu 

50 55 60 

Ala Gly Pro Glu His Leu Lys Leu Leu Asp Gin Ser Val Thr Val Pro 
65 70 75 80 

Ser Pro Ala Thr Asp Ala He Gly Gin Ala Ala Arg Gin Ala Gly Met 

85 90 95 

Val Val Ser lie Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn 

100 105 HO 

Thr Gin Leu Leu Phe Asp Ala Asp Gly Ala Leu He Gin Arg Arg Arg 

115 120 125 

Lys lie Lys Pro Thr His Tyr Glu Arg Met He Trp Gly Glu Gly Asp 

130 135 140 

Gly Ser Gly Leu Arg Ala Val Asp Ser Gin Val Gly Arg He Gly Gin 
145 150 155 160 

Leu Ala Cys Trp Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Met Met 

165 170 175 

Ala Asp Gly Glu Gin He His Ser Ala Met Tyr Pro Gly Ser Met Phe 

180 185 190 

Gly Asp Pro Phe Ala Gin Lys Thr Glu He Asn He Arg Gin His Ala 

195 200 205 

Leu Glu Ser Gly Cys Phe Val Val Cys Ser Thr Ala Trp Leu Asp Ala 

210 215 220 

Asp Gin Gin Ala Gin He Met Gin Asp Thr Gly Cys Ala He Gly Pro 
225 230 235 240 

He Ser Gly Gly Cys Leu Thr Ala He Val Ala Pro Asp Gly Thr Phe 

245 250 255 

Leu Gly Glu Pro Leu Thr Ser Gly Glu Gly Glu Val He Ala Asp Leu 

260 265 270 

Asp Phe Lys Leu He Asp Lys Arg Lys Gin Thr Met Asp Ser Arg Gly 

275 280 285 

His Tyr Asn Arg Pro Glu Leu Leu Ser Leu Leu He Asp Arg Thr Pro 

290 295 300 

Thr Ser Asn Val His Glu Arg Ala Ala His Pro Lys Val Glu Ala Ser 
305 310 315 320 

Gin Thr Ala Gly Asp Thr Glu Arg Thr Arg Glu Val Leu 
325 330 

<210> 235 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 235 

atgaaaattg ttaaagcggc agcagttcag ctgagccccg tcctctatag ccgcgagggg 60 
acggtcgaaa gagtagtgcg gaagattcat caacttggtc aacagggagt gcagtttgcc 12 0 

accttcccgg aaacagtggt gccttactac ccgtattttt cgatcgtgca gtccgggtat 18 0 

caaatccttc gcggcggtga gttcgtgaag ctgctcgatc agtcagtgac ggtgccatct 24 0 

cttgccaccg aagcgatcgc cgaggcctgc aggcaggcgg gcgtcgttgt ctccatcggc 



300 



gtcaatgagc gtgacggcgg aactatatac aatgcgcagc ttctgtttga ttcggacggc 360 
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acattgattc 
cagggcgatg 
gcgtgctttg 
atccactctg 
atcgcggtca 
ctggatgccg 
atctccggcg 
atccgctcgg 
aagcatgttg 
gaccgtactc 
caatgcttgg 



agaggcgacg 
gctcgggtct 
agcattacaa 
caatgtttcc 
ggcagcatgc 
accagcaggc 
gttgcttcac 
gcgaaggtga 
tggactcgag 
ccacagcgca 
aggatcttaa 



caagatcacg 
gcgggctgtg 
ccccctcgca 
cggttccatg 
gcaggagtcc 
tcaaattgcc 
tgcgatcatt 
cgtcattgtc 
aggtcactac 
cgttcacgaa 
cgcgcttgct 



cccacccact acgagcgcat gatctggggc 
gacagcaagg tggcacgtat tggtcaactg 
cgatacgcga tgatcgccga tggcgagcag 
ttcggcgatg gtttcgcgga gaggaccgag 
ggatgctttg tagtttgtgc tacggcgtgg 
gcggacacag gcatcaccga cctgggaccg 
gcacctgatg ggagcctgct gggtcaacca 
gatctcgatt tcactctgat cgacaggcgg 
agccggccgg aattgctaag cctgctgatc 
cgggccgcgc actctcactt ggccgccgag 
taa 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
993 



<210> 236 
<211> 330 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 236 

Met Lys lie Val Lys Ala Ala Ala Val Gin Leu Ser Pro Val Leu Tyr 

15 10 15 

Ser Arg Glu Gly Thr Val Glu Arg Val Val Arg Lys lie His Gin Leu 

20 25 30 

Gly Gin Gin Gly Val Gin Phe Ala Thr Phe Pro Glu Thr Val Val Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser He Val Gin Ser Gly Tyr Gin He Leu Arg 

50 55 60 

Gly Gly Glu Phe Val Lys Leu Leu Asp Gin Ser Val Thr Val Pro Ser 
65 70 75 80 

Leu Ala Thr Glu Ala He Ala Glu Ala Cys Arg Gin Ala Gly Val ■ Val 

85 90 95 

Val Ser lie Gly Val Asn Glu Arg Asp Gly Gly Thr He Tyr Asn Ala 

100 105 110 

Gin Leu Leu Phe Asp Ser Asp Gly Thr Leu He Gin Arg Arg Arg Lys 

115 120 125 

He Thr Pro Thr His Tyr Glu Arg Met He Trp Gly Gin Gly Asp Gly 

130 135 140 

Ser Gly Leu Arg Ala Val Asp Ser Lys Val Ala Arg He Gly Gin Leu 
145 150 155 160 

Ala Cys Phe Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Met He Ala 

165 170 175 

Asp Gly Glu Gin lie His Ser Ala Met Phe Pro Gly Ser Met Phe Gly 

180 185 190 

Asp Gly Phe Ala Glu Arg Thr Glu He Ala Val Arg Gin His Ala Gin 

195 200 205 

Glu Ser Gly Cys Phe Val Val Cys Ala Thr Ala Trp Leu Asp Ala Asp 

210 215 220 

Gin Gin Ala Gin He Ala Ala Asp Thr Gly He Thr Asp Leu Gly Pro 
225 230 235 240 

He Ser Gly Gly Cys Phe Thr Ala He He Ala Pro Asp Gly Ser Leu 

245 250 255 

Leu Gly Gin Pro He Arg Ser Gly Glu Gly Asp Val He Val Asp Leu 

260 265 270 

Asp Phe Thr Leu He Asp Arg Arg Lys His Val Val Asp Ser Arg Gly 

275 280 285 

His Tyr Ser Arg Pro Glu Leu Leu Ser Leu Leu He Asp Arg Thr Pro 

290 295 300 

Thr Ala His Val His Glu Arg Ala Ala His Ser His Leu Ala Ala Glu 
305 310 315 320 
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Gin Cys Leu Glu Asp Leu Asn Ala Leu Ala 
325 330 

<210> 237 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 237 

atgaaagtcg 

accgtcgcca 

acctttccag 

gagcttctgt 

gccgctaccg 

gtcaatgaac 

accttgatcc 

cagggggacg 

gcctgcttcg 

atccattccg 

atcaatatcc 

ctggatgctg 

tcgggcggtt 

cgctcgggcg 

atgttgatgg 

cgcaccccga 

aacccggagg 



tcaaagccgc 
aggtcgtgca 
agaccgtagt 
ccggcacgga 
acgcgatcgg 
gcgatggcgg 
agcgccgccg 
gttcgggcct 
agcacaacaa 
ccatgtatcc 
gccagcatgc 
accagcaggc 
gcttcaccac 
agggtgaggt 
actcggcggg 
ccgcgcacgt 
aacttcgcat 



cgcagtgcag 
gaagatccac 
gccttactac 
gcatctccgg 
agaggcagcc 
caccttgtac 
caagatcacg 
gcgcgctgtc 
cccgctggcc 
gggttctgct 
gctggagtcg 
gcagatcatg 
gatcgtgtca 
catcgtcgat 
ccactataac 
tcatgaacgc 
cccggccgcg 



atcagtcctg 
gaactcggcc 
ccgtactttt 
ctgctcgatc 
cggaaggcag 
aacacacagt 
ccgacccact 
gacagcaagg 
cgctacgcgt 
tttggcgaag 
ggtgcgttcg 
aaggacaccg 
cccgacggca 
ctcgacttca 
cgcccggaac 
gctgcgcgtc 
tga 



ttctctacag 
agaaaggcgt 
ccgccgtcca 
aggccgtgac 
gcatggtggt 
tgctcttcga 
tcgaacgcat 
tcggtcgcat 
tgattgccga 
gatttgccca 
tcgtcaacgc 
gctgcgggat 
tgctgatggc 
cgctgatcga 
tgctcagtct 
cggtgtcggg 



ccgagaggca 
gcagttcgcc 
gacaggcatc 
ggtgccgtct 
gtcgatcggc 
tgccgatggc 
gatctggggc 
tggccagttg 
cggcgagcag 
aaggatggaa 
aacggcctgg 
tggcccgatc 
cgaacccctg 
ccgtcgcaag 
catgatcgac 
cgttgagcag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
993 



<210> 238 
<211> 330 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from 



an environmental sample 



<400> 238 

Met Lys Val Val Lys 

1 5 
Ser Arg Glu Ala Thr 
20 

Gly Gin Lys Gly Val 
35 

Tyr Tyr Pro Tyr Phe 
50 

Gly Thr Glu His Leu 
65 

Ala Ala Thr Asp Ala 
85 

Val Ser lie Gly Val 
100 

Gin Leu Leu Phe Asp 
115 

lie Thr Pro Thr His 
130 

Ser Gly Leu Arg Ala 
145 

Ala Cys Phe Glu His 
165 



Ala Ala Ala 

Val Ala Lys 

Gin Phe Ala 
40 

Ser Ala Val 
55 

Arg Leu Leu 
70 

lie Gly Glu 

Asn Glu Arg 

Ala Asp Gly 
120 

Phe Glu Arg 
135 

Val Asp Ser 
150 

Asn Asn Pro 



Val 

Val 

25 

Thr 

Gin 

Asp 

Ala 

Asp 
105 
Thr 

Met 

Lys 

Leu 



Gin lie 
10 

Val Gin 

Phe Pro 

Thr Gly 

Gin Ala 

75 
Ala Arg 
90 

Gly Gly 

Leu lie 

He Trp 

Val Gly 
155 
Ala Arg 
170 



Ser Pro 

Lys He 

Glu Thr 

45 
He Glu 
60 

Val Thr 

Lys Ala 

Thr Leu 

Gin Arg 
125 
Gly Gin 
140 

Arg He 
Tyr Ala 



Val Leu 

15 
His Glu 
30 

Val Val 

Leu Leu 

Val Pro 

Gly Met 

95 
Tyr Asn 
110 

Arg Arg 

Gly Asp 

Gly Gin 

Leu He 
175 



Tyr 

Leu 

Pro 

Ser 

Ser 

80 

Val 

Thr 

Lys 

Gly 

Leu 
160 
Ala 
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Asp 


Gly 


Glu 


Gin 


lie 


His 


Ser 


Ala 


Met 


Tyr 


Pro 


Gly 


Ser 


Ala 


Phe 


Gly 






180 










185 










190 






Glu 


Gly 


Phe 
195 


Ala 


Gin 


Arg 


Met 


Glu 
200 


He 


Asn 


He 


Arg 


Gin 
205 


His 


Ala 


Leu 


Glu 


Ser 


Gly 


Ala 


Phe 


Val 


Val 


Asn 


Ala 


Thr 


Ala 


Trp 


Leu 


Asp 


Ala 


Asp 




210 








215 










220 










Gin 


Gin 


Ala 


Gin 


He 


Met 


Lys 


Asp 


Thr 


Gly 


Cys 


Gly 


He 


Gly 


Pro 


He 


225 










230 










235 










240 


Ser 


Gly 


Gly 


Cys 


Phe 


Thr 


Thr 


He 


Val 


Ser 


Pro 


Asp 


Gly 


Met 


Leu 


Met 






245 










250 










255 




Ala 


Glu 


Pro 


Leu 

260 


Arg 


Ser 


Gly 


Glu 


Gly 
265 


Glu 


Val 


He 


Val 


Asp 
270 


Leu 


Asp 


Phe 


Thr 


Leu 
275 


lie 


Asp 


Arg 


Arg 


Lys 

280 


Met 


Leu 


Met 


Asp 


Ser 
285 


Ala 


Gly 


His 


Tyr 


Asn 


Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


Leu 


Met 


He 


Asp 


Arg 


Thr 


Pro 


Thr 


290 










295 










300 










Ala 


His 


Val 


His 


Glu 


Arg 


Ala 


Ala 


Arg 


Pro 


Val 


Ser 


Gly 


Val 


Glu 


Gin 


305 










310 










315 










320 


Asn 


Pro 


Glu 


Glu 


Leu 


Arg 


He 


Pro 


Ala 


Ala 















325 330 



<210> 239 
<211> 969 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 239 

atgtccaatg 

tttcttgatc 

gaaggtgctc 

tggggtattc 

aatgcggtta 

aatgcctatg 

tataacacga 

gtgccaacgg 

tacgatactc 

gcacgctata 

ggccaaccct 

ggttgctgta 

tattatgctg 

ggacatttta 

ccacgtatga 

gatgtgttcc 

tctcaatga 



agaataagta 
gtgaagcaac 
gcctgattgt 
cctctggtga 
ccatacccag 
tagtgatggg 
tgttgtatat 
gtggtgaacg 
cattgggaaa 
cgatgtatgc 
ggctctcaac 
tcgcgatgcg 
aaatggatga 
ttgcagggcc 
tgcaaggccc 
agttgacggt 



tgacacattt 
catcgacaaa 
ctttccagaa 
gcaaggttta 
cgacgcgact 
aatgagcgaa 
agatgcacag 
cctggtatgg 
acttggtggg 
ctggggaaca 
gatacggcat 
taaagatgat 
atggataaat 
tgtgcgcaag 
gaagtggatg 
gcatacggat 



aaagttgctg 
gcttgcgagt 
gcgttcatcc 
ctcaacgaac 
gacagattgt 
cggaatgtcg 
ggggagattt 
gcgcaaggtg 
ttaatttgct 
caaatctatg 
attgctaaag 
attccagatc 
gttggggata 
caagaagaaa 
cttgacgtgg 
gtgaggcaga 



cagtccaggc 
tgattgctac 
catcctatcc 
tttattcaga 
gcgaggcggc 
aagcgagtgg 
tagggaaaca 
atggcagcac 
gggaaaatta 
ttgcagcaac 
aaggcagggt 
gttactctat 
gcgcgattgt 
ttctctatgc 
cgggacatta 
tgatacgggt 



cacacctgtg 
tgcaggtcgt 
cgagtgggta 
gttgctcacc 
aaagcttgcg 
tgcaagcctg 
tcggaagctg 
gctgcaggtc 
tatgccgctg 
gtgggattgc 
atacgtggtt 
gaagcagaaa 
caatcccgaa 
ggagattgat 
tgcaagacca 
ggaagatgat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
969 



<210> 240 
<211> 322 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 240 

Met Ser Asn Glu Asn Lys Tyr Asp 

1 5 
Ala Thr Pro Val Phe Leu Asp Arg 
20 



Thr Phe Lys Val Ala Ala Val Gin 

10 15 
Glu Ala Thr He Asp Lys Ala Cys 
25 30 
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Glu Leu lie Ala 
35 

Pro Glu Ala Phe 
50 

Ser Gly Glu Gin 
65 

Asn Ala Val Thr 

Ala Lys Leu Ala 
100 

Val Glu Ala Ser 
115 

Ala Gin Gly Glu 
130 

Gly Glu Arg Leu 
145 

Tyr Asp Thr Pro 

Tyr Met Pro Leu 
180 

Tyr Val Ala Ala 
195 

Arg His lie Ala 
210 

Ala Met Arg Lys 
225 

Tyr Tyr Ala Glu 

Val Asn Pro Glu 
260 

Glu He Leu Tyr 
275 

Trp Met Leu Asp 
290 

Leu Thr Val His 
305 

Ser Gin 



Thr Ala Gly Arg 
40 

He Pro Ser Tyr 
55 

Gly Leu Leu Asn 
70 

He Pro Ser Asp 
85 

Asn Ala Tyr Val 

Gly Ala Ser Leu 
120 

He Leu Gly Lys 
135 

Val Trp Ala Gin 
150 

Leu Gly Lys Leu 
165 

Ala Arg Tyr Thr 

Thr Trp Asp Cys 
200 

Lys Glu Gly Arg 
215 

Asp Asp He Pro 
230 

Met Asp Glu Trp 
245 

Gly His Phe He 

Ala Glu He Asp 
280 

Val Ala Gly His 
295 

Thr Asp Val Arg 
310 



Glu Gly Ala Arg 

Pro Glu Trp Val 
60 

Glu Leu Tyr Ser 
75 

Ala Thr Asp Arg 
90 

Val Met Gly Met 
105 

Tyr Asn Thr Met 

His Arg Lys Leu 
140 

Gly Asp Gly Ser 
155 

Gly Gly Leu He 
170 

Met Tyr Ala Trp 
185 

Gly Gin Pro Trp 

Val Tyr Val Val 
220 

Asp Arg Tyr Ser 
235 

He Asn Val Gly 
250 

Ala Gly Pro Val 

265 

Pro Arg Met Met 

Tyr Ala Arg Pro 
300 

Gin Met He Arg 
315 



Leu He Val Phe 
45 

Trp Gly He Pro 

Glu Leu Leu Thr 
80 

Leu Cys Glu Ala 
95 

Ser Glu Arg Asn 
110 

Leu Tyr He Asp 
125 

Val Pro Thr Gly 

Thr Leu Gin Val 
160 

Cys Trp Glu Asn 
175 

Gly Thr Gin He 
190 

Leu Ser Thr He 
205 

Gly Cys Cys He 

Met Lys Gin Lys 
240 

Asp Ser Ala He 
255 

Arg Lys Gin Glu 
270 

Gin Gly Pro Lys 
285 

Asp Val Phe Gin 

Val Glu Asp Asp 
320 



<210> 241 
<211> 972 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 241 

atgatatcca 

gtgtttcttg 

catgaaggcg 

gtatggggaa 

accaattcag 

gctaatgcct 

ctgtacaata 

ctggtgccaa 

gtttacgata 

ctggcacgct 

cgcggccaac 

atcggttgct 

aagtattatg 



acgagcataa 
atcgcgaagc 
ctcgtttgat 
ttccctctgg 
ttacgatacc 
acgtggtgat 
cgatgttata 
cgggcggaga 
cacctctagg 
acgctatgta 
cctggctctc 
gtatcgcgat 
cggagatgga 



caatactcca 
aacgatcgat 
tgtttttcca 
cgagcaaggt 
cagcaacgca 
ggggatgagc 
tatagatgca 
acgcctggta 
aaagcttggt 
tgcctgggga 
aacgatacgg 
gcgtaaagac 
tgaatggatc 



ttcaaggttg 
aaagcgtgtg 
gaagcgttca 
ttgctcaacg 
actgacaggc 
gaacggaata 
cagggtgaga 
tgggcacaag 
ggtttaattt 
actcaaatct 
catatcgcta 
gatattccag 
aacgtaggtg 



ctgctgtgca 
aactgatcgc 
tcccatccta 
atctgtatgc 
tttgtagagc 
tcgaagcgag 
ttttgggcaa 
gagatggaag 
gctgggaaaa 
acgtcgcggc 
aagagggcag 
ataggtactc 
acagcgcgat 



ggctacacct 
tactgccggt 
tcccgagtgg 
agagttactc 
cgcgaagctt 
cggcgcaagc 
acatcgaaag 
cacgctgcag 
ttatatgccg 
aacgtgggat 
ggtatacgta 
catgaagcag 
tgtcaatcct 



60 ' 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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gagggggact tcattgcggg gcctgtgagc aagcaggagg aaattctcta tgcggagatt 
gatccgcgga tggtgcaagg tccgaagtgg atgctggatg tggcggggca ttacgcgagg 
cctgatgtgt tcgagttgac ggtgcatacg gatgtgaggg agatgatgcg ggtggagcat 
gattatcaat ga 

<210> 242 
<211> 323 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

Met°ile 4 Ser Asn Glu His Asn Asn Thr Pro Phe Lys Val Ala Ala Val 

1 5 10 15 

Gin Ala Thr Pro Val Phe Leu Asp Arg Glu Ala Thr He Asp Lys Ala 

20 25 30 

Cys Glu Leu He Ala Thr Ala Gly His Glu Gly Ala Arg Leu He Val 

35 40 45 

Phe Pro Glu Ala Phe He Pro Ser Tyr Pro Glu Trp Val Trp Gly He 

50 55 60 

Pro Ser Gly Glu Gin Gly Leu Leu Asn Asp Leu Tyr Ala Glu Leu Leu 
65 70 75 80 

Thr Asn Ser Val Thr He Pro Ser Asn Ala Thr Asp Arg Leu Cys Arg 

85 90 9$ 

Ala Ala Lys Leu Ala Asn Ala Tyr Val Val Met Gly Met Ser Glu Arg 

100 105 HO 

Asn He Glu Ala Ser Gly Ala Ser Leu Tyr Asn Thr Met Leu Tyr He 

115 120 125 

Asp Ala Gin Gly Glu He Leu Gly Lys His Arg Lys Leu Val Pro Thr 

130 135 140 

Gly Gly Glu Arg Leu Val Trp Ala Gin Gly Asp Gly Ser Thr Leu Gin 
145 150 155 lt>U 

Val Tyr Asp Thr Pro Leu Gly Lys Leu Gly Gly Leu He Cys Trp Glu 

165 170 175 

Asn Tyr Met Pro Leu Ala Arg Tyr Ala Met Tyr Ala Trp Gly Thr Gin 

180 185 19° 

He Tyr Val Ala Ala Thr Trp Asp Arg Gly Gin Pro Trp Leu Ser Thr 

195 200 205 

He Arg His He Ala Lys Glu Gly Arg Val Tyr Val He Gly Cys Cys 

210 215 220 

He Ala Met Arg Lys Asp Asp He Pro Asp Arg Tyr Ser Met Lys Gin 
225 230 235 240 

Lys Tyr Tyr Ala Glu Met Asp Glu Trp He Asn Val Gly Asp Ser Ala 

245 250 255 

He Val Asn Pro Glu Gly Asp Phe He Ala Gly Pro Val Ser Lys Gin 

260 265 270 

Glu Glu He Leu Tyr Ala Glu He Asp Pro Arg Met Val Gin Gly Pro 

275 280 285 

Lys Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe 

290 295 300 

Glu Leu Thr Val His Thr Asp Val Arg Glu Met Met Arg Val Glu His 
305 310 315 320 

Asp Tyr Gin 



840 
900 
960 
972 



<210> 243 
<211> 999 
<212> DNA 
<213> Unknown 
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<220> 

<223> Obtained from an environmental sample 



<400> 243 

atgaagcaaa 

agtgttgcga 

gtgtttggag 

gcgttctggg 

gtaattccgg 

atttcgatcg 

tctcttttgc 

acttatacgg 

actgcagccg 

caggtgctgc 

gtgcaccagc 

ggattgttga 

cgtgaatccg 

tacattgtgg 

gcatgcgatc 

tttcacctgg 

cggcaaaaat 



ctcgagtagc 
gggctatcga 
agacctggtt 
atcaccggcc 
gaaaggaaat 
gtgtaaacga 
tgattgatga 
aacggatggt 
gcagggttgg 
acatgtcggg 
ttgcatcacg 
tgaaggtccg 
aagactggct 
agccgttgtt 
gtgaaaaaat 
aattcaggaa 
cagcgccgga 



gatcattcag 
tcttgccgga 
gccgggttat 
gacaaaagaa 
cgaacagctc 
aaaaattctg 
atcaggaaaa 
gtggggaatg 
cggattgatt 
tgaggaaatt 
ccattatgca 
ggatattcct 
tctgcgcggc 
tgatcgagag 
gacgctggac 
acagcaatcc 
ccgcgcggac 



gcggaacctg 
cgcgccgcga 
cccgcgtggc 
gtgtttgccc 
tgtaaaactg 
gaaggccccg 
ctggccggcc 
ggtgatggag 
tgctgggagc 
catgtggcag 
tttgaagggc 
ccggagctgg 
gggagcgccg 
gcgattctca 
gtaacgggac 
ggccatattg 
gatcactaa 



tatatctcaa 
agcaaggagc 
tggattactg 
ggacccgcga 
cggcggagct 
gaaacggcac 
atcaccgcaa 
gtggaatgga 
actggatgcc 
tgtggcccac 
gttgttttgt 
aattgccttc 
tcattggtcc 
cagccgatct 
attattcccg 
cgggagcagg 



tttgcaagcg 
gcgtctgata 
tcccggcatg 
gaacagtgtt 
gggagttgta 
gctctacaat 
actggttccg 
agccatatcg 
attgagccgg 
ggttcatgag 
gctcgcagcc 
tcagatgtcg 
ggatggaaag 
tgaattagcc 
ccccgatctt 
aacgatcagc 



<210> 244 
<211> 332 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
999 



<400> 244 

Met Lys Gin Thr Arg Val Ala He He Gin Ala Glu Pro Val Tyr Leu 

15 10 15 

Asn Leu Gin Ala Ser Val Ala Arg Ala He Asp Leu Ala Gly Arg Ala 

20 25 30 

Ala Lys Gin Gly Ala Arg Leu He Val Phe Gly Glu Thr Trp Leu Pro 

35 40 45 

Gly Tyr Pro Ala Trp Leu Asp Tyr Cys Pro Gly Met Ala Phe Trp Asp 

50 55 60 

His Arq Pro Thr Lys Glu Val Phe Ala Arg Thr Arg Glu Asn Ser Val 
65 70 75 80 

Val He Pro Gly Lys Glu He Glu Gin Leu Cys Lys Thr Ala Ala Glu 

85 90 95 

Leu Gly Val Val He Ser He Gly Val Asn Glu Lys He Leu Glu Gly 

100 105 HO 

Pro Gly Asn Gly Thr Leu Tyr Asn Ser Leu Leu Leu He Asp Glu Ser 

115 120 125 

Gly Lys Leu Ala Gly His His Arg Lys Leu Val Pro Thr Tyr Thr Glu 

130 135 140 

Arg Met Val Trp Gly Met Gly Asp Gly Gly Gly Met Glu Ala He Ser 
145 150 155 160 

Thr Ala Ala Gly Arg Val Gly Gly Leu He Cys Trp Glu His Trp Met 

165 170 175 

Pro Leu Ser Arg Gin Val Leu His Met Ser Gly Glu Glu He His Val 

180 185 190 

Ala Val Trp Pro Thr Val His Glu Val His Gin Leu Ala Ser Arg His 

195 200 205 

Tyr Ala Phe Glu Gly Arg Cys Phe Val Leu Ala Ala Gly Leu Leu Met 

'210 215 220 

Lys Val Arg Asp He Pro Pro Glu Leu Glu Leu Pro Ser Gin Met Ser 
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225 230 235 240 



Arg 


Glu 


Ser 


Glu 


Asp 


Trp 


Leu 


Leu 


Arg 


Gly 


Gly 


Ser 


Ala 


Val 


He 


Gly 








245 










250 










255 




Pro 


Asp 


Gly 


Lys 


Tyr 


He 


Val 


Glu 


Pro 


Leu 


Phe 


Asp 


Arg 


Glu 


Ala 


He 






260 










265 










270 






Leu 


Thr 


Ala 
275 


Asp 


Leu 


Glu 


Leu 


Ala 
280 


Ala 


Cys 


Asp 


Arg 


Glu 
285 


Lys 


Met 


Thr 


Leu 


Asp 
290 


Val 


Thr 


Gly 


His 


Tyr 
295 


Ser 


Arg 


Pro 


Asp 


Leu 
300 


Phe 


His 


Leu 


Glu 


Phe 


Arg 


Lys 


Gin 


Gin 


Ser 


Gly 


His 


He 


Ala 


Gly 


Ala 


Gly 


Thr 


He 


Ser 


305 






310 










315 










320 


Arg 


Gin 


Lys 


Ser 


Ala 


Pro 


Asp 


Arg 


Ala 


Asp 


Asp 


His 














325 










330 















<210> 245 
<211> 999 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 245 

atgggtgaga 

cgggatgcga 

cgcctggccg 

ccgccgggcg 

acgattccca 

gtggcgatcg 

ctcctgtaca 

ggagcggagc 

cctttctgtc 

gcgatgtacg 

tggctctcaa 

atgccgatgc 

gggcgcgact 

gccggccccc 

gcgaactcga 

ttgaccgtta 

acgctaccga 



atgccaactt 
cggtcgagaa 
tttttcccga 
attcaaggct 
gcgagtcgac 
gcatcaacga 
tcagcccgga 
ggcttgtctg 
gtttgggtgg 
cctggggcac 
ccttgaggca 
gcatgagcga 
ggatcaatac 
tgagcgagcg 
agtggatgct 
accgcgagcc 
aacgtgcggg 



caccgtcgca 
ggcttgcgag 
ggcgtttgtg 
gctgcacgag 
ggagaaactc 
gaggaatgcg 
cggccaggtc 
ggcgcagggc 
cctcatctgt 
gcaggtctac 
tatcgccagg 
catcccggac 
tggtgacagc 
cgaagagatc 
ggacgtcgcc 
gaacccgatg 
gaagagtagg 



gctgtccagg 
ctcatcgctg 
ccggcttacc 
ctctacggtg 
tgccgggccg 
gaggcaagcg 
ctcgggaagc 
gacggcagca 
tgggagaact 
gtcgcggcaa 
gaggggcgtg 
cgatacgagt 
gccatcgtgg 
ctgtacgccg 
gggcactatg 
atctctgagg 
acgagatga 



caacgccggt 
aagccgggcg 
cggactgggt 
agctgatcca 
cccgcggggc 
ggggtagcct 
accgcaagct 
ctatcgacgt 
acatgccgct 
cgtgggacca 
catatgtcat 
tcaagcgcaa 
gtccggacgg 
atatagacct 
cacggccgga 
atgggcacaa 



cttcttagac 
aaacggagcg 
ctgggctgtt 
gaactctgtc 
caaagtctgc 
ctacaacagc 
cgttcccacc 
gtttgagttg 
ggcccgttat 
cggcgaacct 
tggcgtttgc 
gtactatggc 
aaacttcatc 
gaatcggctt 
cgtcttccag 
gacggttccc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
999 



<210> 246 
<211> 332 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 246 












Met 


Gly 


Glu 


Asn 


Ala 


Asn 


Phe 


Thr 


1 








5 








Val 


Phe 


Leu 


Asp 


Arg 


Asp 


Ala 


Thr 








20 










Ala 


Glu 


Ala 


Gly Arg 


Asn 


Gly Ala 






35 










40 


Phe 


Val 


Pro 


Ala 


Tyr 


Pro 


Asp 


Trp 




50 










55 




Ser 


Arg 


Leu 


Leu 


His 


Glu 


Leu 


Tyr 


65 








70 






Thr 


He 


Pro 


Ser 


Glu 


Ser 


Thr 


Glu 



Val Ala Ala Val Gin Ala Thr Pro 

10 15 
Val Glu Lys Ala Cys Glu Leu He 
25 30 
Arg Leu Ala Val Phe Pro Glu Ala 
45 

Val Trp Ala Val Pro Pro Gly Asp 
60 

Gly Glu Leu He Gin Asn Ser Val 

75 80 
Lys Leu Cys Arg Ala Ala Arg Gly 
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Ala Lys 

Ser Gly 

Gin Val 
130 
Leu Val 
145 

Pro Phe 

Leu Ala 

Ala- Thr 

Ala Arg 
210 
Met Ser 
225 

Gly Arg 

Gly Asn 

Ala Asp 

Val Ala 
290 
Arg Glu 
305 

Thr Leu 



Val 

Gly 
115 
Leu 

Trp 

Cys 

Arg 

Trp 
195 
Glu 

Asp 

Asp 

Phe 

lie 
275 
Gly 

Pro 

Pro 



Cys 
100 
Ser 

Gly 

Ala 

Arg 

Tyr 
180 
Asp 

Gly 

He 

Trp 

He 
260 
Asp 

His 

Asn 

Lys 



85 
Val 

Leu 

Lys 

Gin 

Leu 
165 
Ala 

His 

Arg 

Pro 

He 
245 
Ala 

Leu 

Tyr 

Pro 

Arg 
325 



Ala 

Tyr 

His 

Gly 
150 
Gly 

Met 

Gly 

Ala 

Asp 
230 
Asn 

Gly 

Asn 

Ala 

Met 
310 
Ala 



He 

Asn 

Arg 
135 
Asp 

Gly 

Tyr 

Glu 

Tyr 
215 
Arg 

Thr 

Pro 

Arg 

Arg 
295 
He 

Gly 



Gly He 
105 
Ser Leu 
120 

Lys Leu 

Gly Ser 

Leu He 

Ala Trp 
185 
Pro Trp 
200 

Val He 

Tyr Glu 

Gly Asp 

Leu Ser 
265 
Leu Ala 
280 

Pro Asp 
Ser Glu 
Lys Ser 



90 

Asn Glu 

Leu Tyr 

Val Pro 

Thr He 
155 
Cys Trp 
170 

Gly Thr 

Leu Ser 

Gly Val 

Phe Lys 
235 
Ser Ala 
250 

Glu Arg 

Asn Ser 

Val Phe 

Asp Gly 
315 
Arg Thr 
330 



Arg 

He 

Thr 
140 
Asp 

Glu 

Gin 

Thr 

Cys 
220 
Arg 

He 

Glu 

Lys 

Gin 
300 
His 

Arg 



Asn 

Ser 
125 
Gly 

Val 

Asn 

Val 

Leu 
205 
Met 

Lys 

Val 

Glu 

Trp 
285 
Leu 

Lys 



Ala 
110 
Pro 

Ala 

Phe 

Tyr 

Tyr 
190 
Arg 

Pro 

Tyr 

Gly 

He 
270 
Met 

Thr 

Thr 



95 

Glu Ala 

Asp Gly 

Glu Arg 

Glu Leu 
160 
Met Pro 
175 

Val Ala 

His He 

Met Arg 

Tyr Gly 
240 
Pro Asp 
255 

Leu Tyr 

Leu Asp 

Val Asn 

Val Pro 
320 



<210> 247 
<211> 990 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 247 

atgccgaccc 

gaccgggggg 

gcccggctga 

gttccgccgg 

gtggaagtcc 

tacctggtca 

accctcctct 

acgggagggg 

acccccctgg 

tacgccatgt 

ccctggcttt 

tgcatggcta 

cccgcagccc 

ttcatcgccg 

cagatggggg 

ttcgaactga 

ccgggagaga 

<210> 248 
<211> 329 



ccaaggaaaa 
agacggtaaa 
tcgtgtttcc 
ggagggagaa 
cggggccggt 
tgggggtcac 
atttcggtcc 
aacggaccgt 
gaaagatcgg 
atgcgtgggg 
caaccctgcg 
tgcaaaaagg 
gggagtggat 
ggccggcggg 
ggcccaagtg 
tcgttcaccg 
aagaaacggg 



gtttagaatc 
gaaggcctgc 
ggagtccttc 
gctcctgaat 
gacggaggaa 
cgagcgggac 
ccagggaagc 
ctgggcccgg 
cggcctgatc 
cactcagatt 
gcacatcgcc 
ggacatcccg 
caacacgggc 
aaagaaggaa 
gatgctggac 
ggaggcccgg 
tgaaggatag 



gccgccgttc 
cggctggcgg 
atccccgctt 
gaaatgtacg 
ttgggccggg 
accgaggcca 
ctgctgggaa 
ggggacggca 
tgctgggaga 
tacctggccc 
aaggaagggc 
gatcgcttcg 
gacagcgcca 
gagatcctgt 
gtggccggcc 
ccgatgatcc 



aggcttgccc 
ccgaggccgg 
acccggactg 
ccgaattcct 
cggcggaaag 
gcggggcaag 
aacaccgcaa 
gcacgctgca 
actacatgcc 
ccacctggga 
gggtttacgt 
aatacaagca 
tcctgaaccc 
acgctgaaat 
attacgcccg 
gggtgacgga 



cgtttttctg 
gggccagggc 
ggtgtgggcc 
ggccggcgcg 
ggccggcgct 
cctgtacaac 
actggtgccc 
ggtgtacgat 
cctggcccgc 
ccggggggaa 
ggtggggtgc 
aaaatactat 
ggagggggaa 
agacccccgg 
gccggatgtc 
agcgccgtct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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720 
780 
840 
900 
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990 
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<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 248 

Met Pro Thr Pro Lys Glu Lys Phe Arg lie Ala Ala Val Gin Ala Cys 

15 10 15 

Pro Val Phe Leu Asp Arg Gly Glu Thr Val Lys Lys Ala Cys Arg Leu 

20 25 30 

Ala Ala Glu Ala Gly Gly Gin Gly Ala Arg Leu lie Val Phe Pro Glu 

35 40 45 

Ser Phe He Pro Ala Tyr Pro Asp Trp Val Trp Ala Val Pro Pro Gly 

50 55 60 

Arq Glu Lys Leu Leu Asn Glu Met Tyr Ala Glu Phe Leu Ala Gly Ala 
65 70 75 80 

Val Glu Val Pro Gly Pro Val Thr Glu Glu Leu Gly Arg Ala Ala Glu 

85 90 95 

Arg Ala Gly Ala Tyr Leu Val Met Gly Val Thr Glu Arg Asp Thr Glu 

100 105 HO 

Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr Phe Gly Pro Gin 

115 120 125 

Gly Ser Leu Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly Glu 

130 135 140 

Arg Thr Val Trp Ala Arg Gly Asp Gly Ser Thr Leu Gin Val Tyr Asp 
145 150 155 160 

Thr Pro Leu Gly Lys He Gly Gly Leu He Cys Trp Glu Asn Tyr Met 

165 170 175 

Pro Leu Ala Arg Tyr Ala Met Tyr Ala Trp Gly Thr Gin He Tyr Leu 

180 185 190 

Ala Pro Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg His 

195 200 205 

He Ala Lys Glu Gly Arg Val Tyr Val Val Gly Cys Cys Met Ala Met 

210 215 220 

Gin Lys Gly Asp He Pro Asp Arg Phe Glu Tyr Lys Gin Lys Tyr Tyr 
225 230 . 235 240 

Pro Ala Ala Arg Glu Trp He Asn Thr Gly Asp Ser Ala He Leu Asn 

245 250 255 

Pro Glu Gly Glu Phe He Ala Gly Pro Ala Gly Lys Lys Glu Glu He 

260 265 270 

Leu Tyr Ala Glu He Asp Pro Arg Gin Met Gly Gly Pro Lys Trp Met 

275 280 285 

Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Glu Leu He 

290 295 300 

Val His Arg Glu Ala Arg Pro Met He Arg Val Thr Glu Ala Pro Ser 
305 310 315 320 

Pro Gly Glu Lys Glu Thr Gly Glu Gly 
325 

<210> 249 
<211> 1017 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 249 

atgggcattc aacatccgaa atatcgcgtc gcggtggtgc aggcggcacc ggcctggctc 
gacctcgagg cgtcggtcag caagagcatc gcgctgatag aggaggccgc cgccaagggc 



60 
120 
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gccaagctga 
gactcgccgg 
agctatgaca 
gcagtgctcg 
ggacccgacg 
accgtctacg 
cggctcggtg 
gcccagaacg 
gcgccggcgc 
tcctgcttcg 
gaccgcgacg 
cccgacggca 
gacatcgatc 
tcgcgcccgg 
ttcgcgctgc 



tcgcgttccc 
cctgggcgat 
gcccgcaggc 
gcctgtccga 
gcgagaccat 
gcgagggcga 
cgctgtgctg 
agcaggtgca 
tcggctggga 
tgctggcgcc 
acaagcatgc 
gcgcgatcgc 
tcggcgcgat 
acgtgacgcg 
cgctcgacac 



cgaggccttc 
cggccgcggc 
ggagcgcctg 
gcgcgacggc 
cgcaaagcgg 
cggcagcgac 
ctgggagcat 
tgtcgcggcc 
ggtcaacaat 
ctgcgccacc 
gctgctgcat 
ggacaagctt 
cgggatcgcg 
gctgctgctc 
gctcgcgggc 



atccccggct 
ttcgtgcagc 
aggctcgcag 
ggcagcctct 
cgcaagctgc 
cttgcggtgc 
ctgcagccgc 
tggccgagct 
gcggcctcgc 
gtctcgcagg 
gttggcggcg 
ccgtccgacc 
aagaatgccg 
aacaagaagc 
gaggagatcg 



atccctggta 
gctatttcga 
tgaagaaggc 
atctcgcgca 
ggccgaccca 
atgaccgccc 
tgtcgaaata 
tctcgctgta 
gcgtctatgc 
cgatgatcga 
gccatgccgc 
aggagggcct 
ctgatccggc 
cctcgaagcg 
acgcggccgc 



catctggctg 
caattcgctc 
cggcatgacc 
atggttgatc 
tgccgagcgc 
cggcatcggc 
cgcgatgtac 
cgatccgttc 
cgtcgagggc 
cgagctctgc 
gatcttcggc 
cctgttcgcc 
cgggcactat 
cgtcgagcac 
aagctaa 



<210> 250 
<211> 338 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from 
<400> 250 

Met Gly lie Gin His 

1 5 
Pro Ala Trp Leu Asp 
20 

lie Glu Glu Ala Ala 
35 

Ala Phe lie Pro Gly 
50 

Trp Ala He Gly Arg 

65 

Ser Tyr Asp Ser Pro 
85 

Ala Gly Met Thr Ala 
100 

Leu Tyr Leu Ala Gin 
115 

Lys Arg Arg Lys Leu 
130 

Glu Gly Asp Gly Ser 
145 

Arg Leu Gly Ala Leu 
165 

Tyr Ala Met Tyr Ala 
180 

Ser Phe Ser Leu Tyr 
195 

Asn Asn Ala Ala Ser 
210 

Leu Ala Pro Cys Ala 
225 

Asp Arg Asp Asp Lys 
245 

Ala He Phe Gly Pro 
260 

Asp Gin Glu Gly Leu 
275 



an environmental sample 



Pro Lys Tyr Arg 
Leu 
Ala 
Tyr 



Gly 
70 
Gin 

Val 

Trp 

Arg 

Asp 
150 
Cys 

Gin 

Asp 

Arg 

Thr 
230 
His 

Asp 

Leu 



Glu Ala Ser 
25 

Lys Gly Ala 
40 

Pro Trp Tyr 
55 

Phe Val Gin 



Ala Glu Arg 

Leu Gly Leu 
105 

Leu He Gly 

120 
Pro Thr His 
135 

Leu Ala Val 

Cys Trp Glu 

Asn Glu Gin 
185 

Pro Phe Ala 

200 
Val Tyr Ala 
215 

Val Ser Gin 

Ala Leu Leu 

Gly Ser Ala 
265 

Phe Ala Asp 
280 



Val 
10 
Val 

Lys 

lie 

Arg 

Leu 
90 
Ser 

Pro 

Ala 

His 

His 
170 
Val 

Pro 

Val 

Ala 

His 
250 
He 

He 



Ala 

Ser 

Leu 

Trp 

Tyr 
75 
Arg 

Glu 

Asp 

Glu 

Asp 
155 
Leu 

His 

Ala 

Glu 

Met 
235 
Val 

Ala 

Asp 



Val Val Gin 
Lys 
He 



Leu 

60 

Phe 

Leu 

Arg 

Gly 

Arg 
140 
Arg 

Gin 

Val 

Leu 

Gly 
220 
He 

Gly 

Asp 

Leu 



Ser He 

30 
Ala Phe 
45 

Asp Ser 



Asp Asn 

Ala Val 

Asp Gly 
110 
Glu Thr 
125 

Thr Val 

Pro Gly 

Pro Leu 

Ala Ala 
190 
Gly Trp 

205 

Ser Cys 

Asp Glu 

Gly Gly 

Lys Leu 
270 
Gly Ala 
285 



Ala Ala 
15 

Ala Leu 

Pro Glu 

Pro Ala 

Ser Leu 

80 
Lys Lys 
95 

Gly Ser 

He Ala 

Tyr Gly 

He Gly 
160 
Ser Lys 
175 

Trp Pro 

Glu Val 

Phe Val 

Leu Cys 
240 
His Ala 
255 

Pro Ser 
He Gly 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1017 
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He Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp 

290 295 300 

Val Thr Arg Leu Leu Leu Asn Lys Lys Pro Ser Lys Arg Val Glu His 
305 310 315 320 

Phe Ala Leu Pro Leu Asp Thr Leu Ala Gly Glu Glu He Asp Ala Ala 
325 330 335 

Ala Ser 



<210> 251 
<211> 978 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 251 

gtgaccatcg 

accgtagaaa 

accttcccgg 

gaactgctgt 

cccgccaccg 

gtcaacgagc 

acgctggtac 

cagggcgacg 

gcctgcttcg 

atccattcgg 

atcaacatac 

ctggatgccg 

tccggcggct 

cgcgaaggcg 

ctgctggtgg 

cgcacccctg 

agcctcgaaa 



tgagagctgc 
aagtcgttcg 
aaaccctggt 
ccggcaagga 
atgccattgc 
gcgacggagg 
agcgccgccg 
gctcgggcct 
agcacaacaa 
ccatgtaccc 
gccagcacgc 
accagcaggc 
gcttcacgac 
aaggcgaagt 
actcggtggg 
cggcgaactt 
tcgtctga 



cgccgtgcag 
caagatccgc 
gccctactac 
gcacctgcga 
ccaggcggca 
caccatctac 
caagatcacg 
gcgagccgtg 
cccgctggcg 
gggctctgcc 
cctggagtct 
gcagatcatg 
catcgtcacg 
catcgccgac 
ccactacaac 
ccatgagcgc 



atcagtcccg 
gaactgggaa 
ccgtacttcg 
ctgctggaac 
cgcgaagccg 
aacacgcagc 
ccgacgcatt 
gacaccaagg 
cgctacgcca 
ttcggcgagg 
ggctgcttcg 
aaggatacgg 
ccggacggca 
ctcgatttca 
cgtccggagc 
aatgcgcttc 



tgctctacag 
gaaacggcgt 
cggccgtgca 
aatccgtaac 
gcatggtggt 
tgctgtttga 
tcgagcgcat 
ccggccgcat 
tgatcgccga 
gcttcgcgca 
tggtgaatgc 
gctgcggcat 
tgctgatcgg 
ccctgatcga 
tgctgagcct 
cgtccgtcaa 



ccgggaagcc 
gcagttcgcc 
gacgggcatc 
ggttccctcg 
gtccatcggt 
cgccgacggc 
ggtctggggc 
cggtcagctc 
cggtgagcag 
gcgcatggaa 
gaccgcgtgg 
cggcccgatc 
tgaacccctc 
ccggcgcaag 
gctgatcgat 
caccgccagc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
978 



<210> 252 
<211> 325 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 252 



Val 


Thr 


He 


Val 


Arg 


Ala 


Ala 


Ala 


1 








5 








Ser 


Arg 


Glu 


Ala 

20 


Thr 


Val 


Glu 


Lys 


Gly 


Arg 


Asn 
35 


Gly 


Val 


Gin 


Phe 


Ala 
40 


Tyr 


Tyr 
50 


Pro 


Tyr 


Phe 


Ala 


Ala 
55 


Val 


Gly 


Lys 


Glu 


His 


Leu 


Arg 


Leu 


Leu 


65 










70 






Pro 


Ala 


Thr 


Asp 


Ala 
85 


He 


Ala 


Gin 


Val 


Ser 


He 


Gly 
100 


Val 


Asn 


Glu 


Arg 


Gin 


'Leu 


Leu 
115 


Phe 


Asp 


Ala 


Asp 


Gly 
120 



Val 


Gin 


He 


Ser 


Pro 


Val 


Leu 


Tyr 




10 










15 




Val 


Val 


Arg 


Lys 


He 


Arg 


Glu 


Leu 


25 










30 






Thr 


Phe 


Pro 


Glu 


Thr 


Leu 


Val 


Pro 










45 








Gin 


Thr 


Gly 


He 


Glu 


Leu 


Leu 


Ser 








60 










Glu 


Gin 


Ser 


Val 


Thr 


Val 


Pro 


Ser 






75 










80 


Ala 


Ala 


Arg 


Glu 


Ala 


Gly 


Met 


Val 




90 










95 




Asp 


Gly 


Gly 


Thr 


He 


Tyr 


Asn 


Thr 


105 










110 






Thr 


Leu 


Val 


Gin 


Arg 


Arg 


Arg 


Lys 



125 
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lie Thr Pro 

130 
Ser Gly Leu 
145 

Ala Cys Phe 

Asp Gly Glu 

Glu Gly Phe 
195 

Glu Ser Gly 

210 
Gin Gin Ala 
225 

Ser Gly Gly 

Gly Glu Pro 

Phe Thr Leu 
275 

Tyr Asn Arg 

290 
Ala Asn Phe 
305 

Ser Leu Glu 



Thr His Phe 

Arg Ala Val 
150 

Glu His Asn 

165 
Gin lie His 
180 

Ala Gin Arg 

Cys Phe Val 

Gin He Met 
230 

Cys Phe Thr 

245 
Leu Arg Glu 
260 

He Asp Arg 

Pro Glu Leu 

His Glu Arg 
310 

lie Val 

325 



Glu Arg 
135 

Asp Thr 

Asn Pro 

Ser Ala 

Met Glu 
200 
Val Asn 
215 

Lys Asp 

Thr He 

Gly Glu 

Arg Lys 
280 
Leu Ser 
2 95 

Asn Ala 



Met Val 

Lys Ala 

Leu Ala 
170 
Met Tyr 
185 

He Asn 

Ala Thr 

Thr Gly 

Val Thr 
250 
Gly Glu 
265 

Leu Leu 
Leu Leu 
Leu Pro 



Trp Gly 
140 
Gly Arg 
155 

Arg Tyr 

Pro Gly 

He Arg 

Ala Trp 
220 

Cys Gly 
235 

Pro Asp 

Val He 

Val Asp 

1.1 e Asp 
300 
Ser Val 
315 



Gin Gly Asp Gly 
He Gly 
Ala Met 



Ser Ala 
190 
Gin His 
205 

Leu Asp 

He Gly 

Gly Met 

Ala Asp 
270 
Ser Val 
285 

Arg Thr 
Asn Thr 



Gin Leu 
160 
He Ala 
175 

Phe Gly 



Ala Leu 

Ala Asp 

Pro He 
240 
Leu He 
255 

Leu Asp 

Gly His 

Pro Ala 

Ala Ser 
320 



<210> 253 
<211> 924 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 253 

atgtcaaacg 

ttcctcgatc 

gaaggggcga 

tgggcaatcc 

aattcggtca 

aatgcctacg 

tataacacgc 

gtgccaactg 

tacgatactc 

gcccgctaca 

gggcaaccct 

ggctgttgta 

ttttacgcgg 

gggcagttta 

ccgcgcatgg 

gacgtattcc 



agaaccacaa 
gtgaagcgac 
ggctggttgt 
caccgggcga 
cgattcccag 
tagtgatggg 
tattgtacat 
gcggcgagcg 
cactgggtaa 
ccatgtacgc 
ggctctccac 
tcgcgatgcg 
aggccgatga 
tcgcggggcc 
tgcaaggccc 
agttgacagt 



ccaaacattc 
catcgacaaa 
cttcccggag 
agaaggcgtg 
tgacgtgacg 
gatgagcgaa 
cgatgcgaag 
actggtgtgg 
actcggcggt 
ctggggcaca 
cttgcggcat 
caaagatgat 
gtggatcaat 
ggtacgcaaa 
gaagtggatg 
gtaa 



aaagttgccg 
gcttgcgagt 
gcattcatcc 
ctcaatgagt 
gatagactgt 
tgcaatgccg 
ggtgaaatcc 
gcacagggcg 
ttaatttgct 
caaatctata 
atcgccaaag 
atcccagagc 
ataggcgaca 
caggaagaga 
ctcgacgtgg 



cggtgcaggc 
tgattgctgc 
cgtcctatcc 
tgtacgcgga 
gccgggccgc 
aggccagtgg 
tgggtaaaca 
atggcagcac 
gggagaatta 
tcgcagcgac 
aaggcagggt 
gttacccaat 
gcgcgatcgt 
ttctctacgc 
cagggcacta 



cacacctgta 
agccggcaat 
agattgggta 
actgctctcc 
gaggcttgcc 
cgcaagcctg 
tcgaaagttg 
gctgcaggtc 
tatgccgctg 
atgggatcgc 
gtacgtgatc 
gaagcagaag 
caatcctgaa 
ggagattaat 
cgccaggccg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
924 



<210> 254 
<211> 307 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
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<400> 254 

Met Ser Asn Glu Asn His Asn Gin Thr Phe Lys Val Ala Ala Val Gin 

15 10 15 

Ala Thr Pro Val Phe Leu Asp Arg Glu Ala Thr lie Asp Lys Ala Cys 

20 25 30 

Glu Leu He Ala Ala Ala Gly Asn Glu Gly Ala Arg Leu Val Val Phe 

35 40 45 

Pro Glu Ala Phe He Pro Ser Tyr Pro Asp Trp Val Trp Ala He Pro 

50 55 60 

Pro Gly Glu Glu Gly Val Leu Asn Glu Leu Tyr Ala Glu Leu Leu Ser 
65 70 75 80 

Asn Ser Val Thr He Pro Ser Asp Val Thr Asp Arg Leu Cys Arg Ala 

85 90 95 

Ala Arg Leu Ala Asn Ala Tyr Val Val Met Gly Met Ser Glu Cys Asn 

100 105 HO 

Ala Glu Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr He Asp 

115 120 125 

Ala Lys Gly Glu He Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly 

130 135 140 

Gly Glu Arg Leu Val Trp Ala Gin Gly Asp Gly Ser Thr Leu Gin Val 
145 150 155 160 

Tvr Asp Thr Pro Leu Gly Lys Leu Gly Gly Leu He Cys Trp Glu Asn 

165 170 175 

Tyr Met Pro Leu Ala Arg Tyr Thr Met Tyr Ala Trp Gly Thr Gin He 

180 185 190 

Tyr He Ala Ala Thr Trp Asp Arg Gly Gin Pro Trp Leu Ser Thr Leu 

195 200 205 

Arg His He Ala Lys Glu Gly Arg Val Tyr Val He Gly Cys Cys He 

210 215 220 

Ala Met Arg Lys Asp Asp He Pro Glu Arg Tyr Pro Met Lys Gin Lys 
225 230 235 240 

Phe Tyr Ala Glu Ala Asp Glu Trp He Asn He Gly Asp Ser Ala He 

245 250 255 

Val Asn Pro Glu Gly Gin Phe He Ala Gly Pro Val Arg Lys Gin Glu 

260 265 270 

Glu He Leu Tyr Ala Glu He Asn Pro Arg Met Val Gin Gly Pro Lys 

. 275 280 285 

Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Gin 

290 295 300 

Leu Thr Val 
305 

<210> 255 
<211> 1005 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<400> 255 _ 
atgacaatgt ctaagaccaa atttagggtt gcagctgtgc aggcggcgcc ggttttcctt 
gatcgggaag cgacgttgga taaagcttgt ggattgattg aggaggcggg ccgcaacggc 
gccagcctcg tcgtcttccc tgagtcattc attccggcct accccgattg ggtttgggct 
gtgccggcgg gcgaagaagc tttactcaat gaactgtacg cacaactgtt ggccaacgcc 
gttgaaattc ccggcccggc cactcaacgt ttgagccagg cggctaaaaa ggctaaggtt 
cacctggcta tgggcctgac cgaacgcaac agcgaggcca gcggcggcag cctttacaac 
accttgctct atcttgaccc gcagggccac attctgggca agcatcgtaa gctggtgccc 
accggcggtg agcggctggt ttgggcccag ggcgacggca gcactttgca agtttacgat 
acgcctctgg gtaaactcag cggcctgatt tgctgggaaa attatatgcc gctggcgcgc 
tacgcgctgt atgcctgggg tacgcaaatt tatattgcgg ccacctggga tcggggtgag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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660 



840 
900 
960 



ccgtggcttt cgacgttgcg gcatattgcc aaagagggcc gggtgttggt catcggttgc 
ggtatggcct tgcgcaaggc tgatattcct gatcattttg aattcaagca gcgcttttat 72 0 

caaaacgccg ccgagtggat caacgggggc gacagcgcca ttgtcaaccc tgatggtgaa 7 80 

tttattgctg gccccttaag cgagcaggaa ggcattttgt acgccgagat tgatccggcc 
cagatgggcg ggccaaagtg gatgctcgac gtggccgggc attacgctcg cccggatgtg 
tttgaactga cggtccatac cgccgcccga cccatgatca cctcgaaaaa ggatggccta 
acacccgccg aggccgttac gcaagtaacg aaagcattat tgtaa 1005 

<210> 256 
<211> 334 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 256 

Met Thr Met Ser Lys Thr Lys Phe Arg Val Ala Ala Val Gin Ala Ala 

15 10 15 

Pro Val Phe Leu Asp Arg Glu Ala Thr Leu Asp Lys Ala Cys Gly Leu 

20 25 30 

lie Glu Glu Ala Gly Arg Asn Gly Ala Ser Leu Val Val Phe Pro Glu 

35 40 45 

Ser Phe lie Pro Ala Tyr Pro Asp Trp Val Trp Ala Val Pro Ala Gly 

50 55 60 

Glu Glu Ala Leu Leu Asn Glu Leu Tyr Ala Gin Leu Leu Ala Asn Ala 
65 70 75 80 

Val Glu lie Pro Gly Pro Ala Thr Gin Arg Leu Ser Gin Ala Ala Lys 

85 90 95 

Lys Ala Lys Val His Leu Ala Met Gly Leu Thr Glu Arg Asn Ser Glu 

100 105 HO 

Ala Ser Gly Gly Ser Leu Tyr Asn Thr Leu Leu Tyr Leu Asp Pro Gin 

115 120 125 

Gly His He Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly Glu 

130 135 140 

Arg Leu Val Trp Ala Gin Gly Asp Gly Ser Thr Leu Gin Val Tyr Asp 
145 150 155 160 

Thr Pro Leu Gly Lys Leu Ser Gly Leu He Cys Trp Glu Asn Tyr Met 

165 170 175 

Pro Leu Ala Arg Tyr Ala Leu Tyr Ala Trp Gly Thr Gin He Tyr He 

180 185 190 

Ala Ala Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg His 

195 200 205 

He Ala Lys Glu Gly Arg Val Leu Val He Gly Cys Gly Met Ala Leu 

210 215 220 

Ara Lvs Ala Asp He Pro Asp His Phe Glu Phe Lys Gin Arg Phe Tyr 
225 230 235 240 

Gin Asn Ala Ala Glu Trp He Asn Gly Gly Asp Ser Ala He Val Asn 

245 250 255 

Pro Asp Gly Glu Phe He Ala Gly Pro Leu Ser Glu Gin Glu Gly He 

260 265 270 

Leu Tyr Ala Glu He Asp Pro Ala Gin Met Gly Gly Pro Lys Trp Met 

275 ' 280 285 

Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Glu Leu Thr 

290 295 300 

Val His Thr Ala Ala Arg Pro Met He Thr Ser Lys Lys Asp Gly Leu 
305 310 315 320 

Thr Pro Ala Glu Ala Val Thr Gin Val Thr Lys Ala Leu Leu 
325 330 

<210> 257 
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<211> 942 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 257 

atgaacaaga 

gcggttgcgg 

ccagaaacct 

ggtgcgctca 

ggagatctgg 

ctcaacgaat 

gaccccgacg 

atggtgcacg 

gtgggcgcac 

gagggggtgg 

atgcgtcata 

ggcagcgacg 

tggatcaatc 

atgcgacgtg 

cgccgcacgc 

cgacgtacgc 



tcgcgatcat 
tggagtcgat 
tcatccccgg 
ttgcccagtt 
atgccctgtg 
gcgagcgcag 
gcaagctgtg 
gtctgggtga 
tcatctgctg 
aagtctacgt 
ttgcgcttga 
tcccagaaga 
cgggcgactc 
agacaggcat 
tcgatgtcgc 
cggcgatgcc 



tcagcgccca 
cgacgaagct 
ctacccgtcc 
gcatgcccga 
tgaagttgcg 
tcgcggcggg 
caatcgccac 
tgcatcgggc 
ggaaaactat 
ggcgcccacc 
gggacgctgc 
ctttccgtca 
ggtggtcgtc 
tctctacgca 
cggacactac 
ggtccacgcc 



cccgtgctac 
gccgcagccg 
tggatctggc 
ctgctcgcca 
cacggccacc 
ggcactctct 
cgcaagctga 
ctgcgcgccg 
atgccgctgg 
tatgacagcg 
tgggtgctgg 
cacctggacc 
gatcctcagg 
gaaatcgacg 
gcccgcccgg 
gttgatgcat 



tcgaccgcat 
gtgcctcact 
gtctcgcgcc 
acgcggtcga 
gggtgaccgt 
acaacacggt 
tgccgaccaa 
tcgacacccc 
cacgctacgc 
gcgatggctg 
gtagcggaac 
tgtttcccga 
gcaagatcgt 
ccgaacgggt 
atattttcga 
ga 



cgccacgctg 
gatcgttctt 
ggggagggat 
tcttgcggct 
ggtgtgcggc 
cgtcgtgatc 
cccggaacgc 
ggtgggtcga 
actttacgcc 
gatcagtacg 
cgtactgcgt 
cgcggaggaa 
cgcaggcccg 
cgcgccttcg 
gctccatgtc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
942 



<210> 258 
<211> 313 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 258 

Met Asn Lys He Ala 

1 . 5 

He Ala Thr Leu Ala 
20 

Ala Gly Ala Ser Leu 
35 

Pro Ser Trp He Trp 
50 

Ala Gin Leu His Ala 
65 

Gly Asp Leu Asp Ala 
85 

Val Val Cys Gly Leu 
100 

Leu Tyr Asn Thr Val 
115 

Arg His Arg Lys Leu 
130 

Leu Gly Asp Ala Ser 
145 

Val Gly Ala Leu He 
165 

Ala Leu Tyr Ala Glu 
180 

Ser Gly Asp Gly Trp 
195 

Arg Cys Trp Val Leu 



He He Gin Arg 
Val Ala 
He Val 



Arg Leu 

55 
Arg Leu 
70 

Leu Cys 

Asn Glu 

Val Val 

Met Pro 
135 
Gly Leu 
150 

Cys Trp 
Gly Val 
He Ser 
Gly Ser 



Val Glu 

25 
Leu Pro 
40 

Ala Pro 



Leu Ala 

Glu Val 

Cys Glu 
105 
He Asp 
120 

Thr Asn 

Arg Ala 

Glu Asn 

Glu Val 
185 
Thr Met 
200 

Gly Thr 



Pro Pro 
10 

Ser He 

Glu Thr 

Gly Arg 

Asn Ala 

75 
Ala His 
90 

Arg Ser 

Pro Asp 

Pro Glu 

Val Asp 
155 
Tyr Met 
170 

Tyr Val 
Arg His 
Val Leu 



Val Leu 

Asp Glu 

Phe He 

45 
Asp Gly 
60 

Val Asp 

Gly His 

Arg Gly 

Gly Lys 
125 
Arg Met 
140 

Thr Pro 

Pro Leu 

Ala Pro 

He Ala 
205 
Arg Gly 



Leu Asp 
15 

Ala Ala 
30 

Pro Gly 

Ala Leu 

Leu Ala 

Arg Val 
95 

Gly Gly 
110 

Leu Cys 

Val His 

Val Gly 

Ala Arg 
175 
Thr Tyr 
190 

Leu Glu 



Arg 

Ala 

Tyr 

He 

Ala 

80 

Thr 

Thr 

Asn 

Gly 

Arg 
160 
Tyr 



Asp 
Gly 
Ser Asp Val 
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210 










215 










220 










Pro 


Glu 


Asp 


Phe 


Pro 


Ser 


His 


Leu 


Asp 


Leu 


Phe 


Pro 


Asp 


Ala 


Glu 


Glu 


225 








230 










235 










240 


Trp 


He 


Asn 


Pro 


Gly 


Asp 


Ser 


Val 


Val 


Val 


Asp 


Pro 


Gin 


Gly 


Lys 


He 








245 










250 










255 




Val 


Ala 


Gly 


Pro 


Met 


Arg 


Arg 


Glu 


Thr 


Gly 


He 


Leu 


Tyr 


Ala 


Glu 


He 






260 










265 










270 






Asp 


Ala 


Glu 


Arg 


Val 


Ala 


Pro 


Ser 


Arg 


Arg 


Thr 


Leu 


Asp 


Val 


Ala 


Gly 




275 










280 










285 








His 


Tyr 
290 


Ala 


Arg 


Pro 


Asp 


He 
295 


Phe 


Glu 


Leu 


His 


Val 
300 


Arg 


Arg 


Thr 


Pro 


Ala 


Met 


Pro 


Val 


His 


Ala 


Val 


Asp 


Ala 
















305 










310 























<210> 259 
<211> 981 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 259 

atggccatca 

accgtggaca 

gtctttccgg 

gccatgggca 

accttggcga 

gagcgcgatg 

attcagcacc 

gacggctccg 

tgggaacatt 

gcggctatgt 

attcgtcatc 

gctgatcagc 

ggctgcttca 

ggtgagggcg 

atggattccg 

cccaccgctc 

cttgaccatg 



tcaaagccgc 
aggtctgcca 
aaacggtggt 
aggaacacct 
tcggcgaagc 
gcggcacgat 
gtcgcaaaat 
ggttgcgcgc 
acaacccctt 
ttcccggctc 
acgccttgga 
aaggccaaat 
cggccatcgt 
cagtcatcgc 
tcggccatta 
atgtgcatga 
caaaccaatg 



cgccgtccag 
gcagatcatc 
gccgtactac 
gaaactgttg 
gtgcaaacaa 
ctacaacgcc 
aaccccgacg 
catcgacagc 
ggcccgctac 
tctggtgggt 
gtccggctgc 
catgcaggac 
ttccccggaa 
cgacctggac 
cagccgccca 
gcgcggcgcc 
a 



atcagtccgg 
gccctcggtc 
ccctatttct 
gaacaatcgg 
gcgggaatgg 
cagttgctgt 
taccacgagc 
gcggtcgggc 
gccctgatgg 
gacatttttg 
ttcgtggtca 
accggttgca 
ggcaaattac 
atggcattga 
gaactgctca 
catcaccttg 



tgctgtatag 
agcaaggcgt 
ccttcgtcca 
tgattgtgcc 
tggtgtctat 
ttgacgctga 
ggatgatctg 
gtattggctc 
ccgacggcga 
ccgatcagat 
actccaccgc 
gcattggccc 
tcggcgaacc 
tcgacaagcg 
gtttattgat 
ccgtagcctc 



ccgcgaggga 
gcaatttgcg 
gccgccgttc 
gtcggctgcc 
cggcgtcaat 
tggcagcttg 
gggtcaaggc 
gctggcctgc 
gcagattcac 
agaggtcact 
gtggcttgat 
aatctcgggt 
gctgcgttca 
caaacggatg 
cgaccgcacg 
tatcggggag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
981 



<210> 260 
<211> 326 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 260 




























Met 


Ala 


He 


He 


Lys 


Ala 


Ala 


Ala 


Val 


Gin 


He 


Ser 


Pro 


Val 


Leu 


Tyr 


1 








5 










10 










15 




Ser 


Arg 


Glu 


Gly 


Thr 


Val 


Asp 


Lys 


Val 


Cys 


Gin 


Gin 


He 


He 


Ala 


Leu 






20 










25 










30 






Gly 


Gin 


Gin 


Gly 


Val 


Gin 


Phe 


Ala 


Val 


Phe 


Pro 


Glu 


Thr 


Val 


Val 


Pro 




35 










40 










45 








Tyr 


Tyr 


Pro 


Tyr 


Phe 


Ser 


Phe 


Val 


Gin 


Pro 


Pro 


Phe 


Ala 


Met 


Gly 


Lys 


50 










55 










60 










Glu 


His 


Leu 


Lys 


Leu 


Leu 


Glu 


Gin 


Ser 


Val 


He 


Val 


Pro 


Ser 


Ala 


Ala 


65 








70 










75 










80 


Thr 


Leu 


Ala 


He 


Gly 


Glu 


Ala 


Cys 


Lys 


Gin 


Ala 


Gly Met 


Val 


Val 


Ser 
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85 90 95 

lie Gly Val Asn Glu Arg Asp Gly Gly Thr He Tyr Asn Ala Gin Leu 

100 105 HO 

Leu Phe Asp Ala Asp Gly Ser Leu He Gin His Arg Arg Lys He Thr 

115 120 125 

Pro Thr Tyr His Glu Arg Met He Trp Gly Gin Gly Asp Gly Ser Gly 

130 135 140 

Leu Arg Ala lie Asp Ser Ala Val Gly Arg He Gly Ser Leu Ala Cys 
145 150 155 160 

Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met Ala Asp Gly 

165 170 175 

Glu Gin He His Ala Ala Met Phe Pro Gly Ser Leu Val Gly Asp He 

180 185 190 

Phe Ala Asp Gin He Glu Val Thr He Arg His His Ala Leu Glu Ser 

195 200 205 

Gly Cys Phe Val Val Asn Ser Thr Ala Trp Leu Asp Ala Asp Gin Gin 

210 215 220 

Gly Gin lie Met Gin Asp Thr Gly Cys Ser He Gly Pro He Ser Gly 
225 230 235 240 

Gly Cys Phe Thr Ala He Val Ser Pro Glu Gly Lys Leu Leu Gly Glu 

245 250 • 255 

Pro Leu Arg Ser Gly Glu Gly Ala Val He Ala Asp Leu Asp Met Ala 

260 265 270 

Leu He Asp Lys Arg Lys Arg Met Met Asp Ser Val Gly His Tyr Ser 

275 280 285 

Arg Pro Glu Leu Leu Ser Leu Leu He Asp Arg Thr Pro Thr Ala His 

290 295 300 

Val His Glu Arg Gly Ala His His Leu Ala Val Ala Ser He Gly Glu 
305 310 315 320 

Leu Asp His Ala Asn Gin 
325 

<210> 261 
<211> 1014 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 261 

atgggtctgg ttcatcagaa atacaaggta gcggtggttc aggcggcacc ggtttttctc 
gatctcgatg caaccgtgga caagaccatt gcgctgatcg aggaagcttc cgcgcaaggc 120 
gcaaaactgg tcgcgtttcc cgagaccttc attcccggat atccgtggca gatctggctc 180 
ggcgcgccgg cctgggcgat cggccgcggc tttgtgcagc gctacttcga caattcgctg 24 0 

ggcttcgaca gcccgcaggc ggaaaaaatc cgccaggccg tgaagcgcgc caagctgacc 300 
gcggtgcttg ggctatccga acgcgacggc ggcagcctct atatcgcgca gtggctgatc 3 60 

ggccctgacg gcgagaccat cgccaagcgc cggaaactgc gtccgaccca tgccgaacgc 420 
accgtgttcg gcgaaggcga cggcagcgat cttgcggtcc acgatcgcgc cgacgtcggg 48 0 

cgcttgggcg cactgtgctg ctgggagcac ctgcagccgc tatcgaaata cgcgatgtac 54 0 

gcccagaacg aacaggtgca tgtcggcgcc tggccgagct tttcgctgta cgatccgttc 
gcacatgcgc tcggccatga ggtcaacaac gccgccagca aggtctatgc ggtcgaaggc 
tcctgcttct tcctgggtcc ctgcgctgtc gtctcgcagg cgatgatcga cgagctctgc 
gactctcccg agaaacatgc cttcctgcat gtcggcggcg gccacgcggt gatctacggc 
ccggacggca gttcgctggc cgagaaactc ccgcccgacc aggaaggcat tctgtacgcc 
gatatcgatc tcggcatgat cggggtcgcg aagaacgccg ccgatccggc cgggcattat 
tcgcggcccg acgtcacgcg gctgttgctc aacaccacgc gcgccaaccg cgtcgagcat 
ttctcgcttc ccgtcgatgc cgaggtcatg agcgaaatca ggctgcaggc ctga 

<210> 262 
<211> 337 
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<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 262 

Met Gly Leu Val His Gin Lys Tyr Lys Val Ala Val Val Gin Ala Ala 

15 10 15 

Pro Val Phe Leu Asp Leu Asp Ala Thr Val Asp Lys Thr lie Ala Leu 

20 25 30 

He Glu Glu Ala Ser Ala Gin Gly Ala Lys Leu Val Ala Phe Pro Glu 

35 40 45 

Thr Phe He Pro Gly Tyr Pro Trp Gin He Trp Leu Gly Ala Pro Ala 

50 55 60 

Trp Ala lie Gly Arg Gly Phe Val Gin Arg Tyr Phe Asp Asn Ser Leu 
65 70 75 &0 

Gly Phe Asp Ser Pro Gin Ala Glu Lys He Arg Gin Ala Val Lys Arg 

85 90 95 

Ala Lys Leu Thr Ala Val Leu Gly Leu Ser Glu Arg Asp Gly Gly Ser 

100 105 HO 

Leu Tyr He Ala Gin Trp Leu He Gly Pro Asp Gly Glu Thr He Ala 

115 120 125 

Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Phe Gly 

130 135 140 

Glu Gly Asp Gly Ser Asp Leu Ala Val His Asp Arg Ala Asp Val Gly 
145 150 155 160 

Arg Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gin Pro Leu Ser Lys 

165 170 175 

Tyr Ala Met Tyr Ala Gin Asn Glu Gin Val His Val Gly Ala Trp Pro 

180 185 190 

Ser Phe Ser Leu Tyr Asp Pro Phe Ala His Ala Leu Gly His Glu Val 

195 200 205 

Asn Asn Ala Ala Ser Lys Val Tyr Ala Val Glu Gly Ser Cys Phe Phe 

210 215 220 

Leu Gly Pro Cys Ala Val Val Ser Gin Ala Met He Asp Glu Leu Cys 
225 230 235 240 

Asp Ser Pro Glu Lys His Ala Phe Leu His Val Gly Gly Gly His Ala 

245 250 255 

Val He Tyr Gly Pro Asp Gly Ser Ser Leu Ala Glu Lys Leu Pro Pro 

260 265 270 

Asp Gin Glu Gly He Leu Tyr Ala Asp He Asp Leu Gly Met He Gly 

275 280 285 

Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp 

290 295 300 

Val Thr Arg Leu Leu Leu Asn Thr Thr Arg Ala Asn Arg Val Glu His 
305 310 315 320 

Phe Ser Leu Pro Val Asp Ala Glu Val Met Ser Glu He Arg Leu Gin 
325 330 335 

Ala 



<210> 263 
<211> 1014 
<212> DNA 
<213> Unknown 



<220> 
<223> 



Obtained from an environmental sample 



<400> 



263 
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atgaaagtcg 
accgtcgcga 
accttcccgg 
cagcttctgg 
cccgcgaccg 
gtcaacgagc 
accctgatcc 
cagggtgacg 
gcatgcttcg 
atccattcgg 
atcaatatcc 
ctggatgccg 
tcgggcggtt 
cgctcgggcg 
atgctgatgg 
cgcacgccga 
ccgcgctccg 



tcaaagcggc 
aggtcgtgcg 
aaaccgttgt 
ctggaaccga 
acgcgatcgg 
gtgatggtgg 
agcgccgccg 
ggtcgggcct 
agcacaacaa 
ccatgtatcc 
gccagcacgc 
accagcaggc 
gcttcaccac 
agggcgaggt 
actcggcagg 
ccgcaaacgt 
gcgatctgcc 



agcggttcag 
gaagatccac 
gccgtactac 
gcatctgaaa 
cgaggcagcc 
aaccctgtac 
caagatcacg 
gcgtgccgtc 
tcctctggcg 
gggttctgcc 
actggagtcc 
gcaaatcatg 
gatcgtggca 
catcgccgat 
ccactacaac 
gcacgaacgg 
cgctgcgctg 



ttgagccctg 
gagcttgggc 
ccgtatttct 
ttgctcgacc 
cggaaggcgg 
aacacccaat 
cccacccatt 
gacagcaagg 
cgctacgcga 
ttcggcgagg 
gggtgcttcg 
aaagacacgg 
cccgacggca 
ctcgatttca 
cgtccggaac 
atggcgcatc 
gctggcgcgc 



tcctctacag 
agcagggcgt 
ccgcggtcca 
aggccgtgac 
gcatggtggt 
tgctcttcga 
tcgagcgcat 
tcggccgcat 
tgatggccga 
gctttgccca 
tcgtgaacgc 
gctgcgggat 
cgctgctggg 
cggagatcga 
tgctcagtct 
cccaagcgag 
aggagatcct 



ccgcgaggca 
gcagttcgcc 
gacgccgatg 
ggtgccgtct 
gtccatcggc 
cgccgatggc 
gatctggggc 
tggccagctg 
cggcgagcag 
gaggatggaa 
gacggcctgg 
cggtccgatc 
ggaacctctg 
ccggcgcaag 
gctgatcgac 
cacgaagcag 
gtga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1014 



<210> 264 
<211> 337 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from 
<400> 264 

Met Lys Val Val Lys 

1 5 
Ser Arg Glu Ala Thr 
20 

Gly Gin Gin Gly Val 
35 

Tyr Tyr Pro Tyr Phe 
50 

Gly Thr Glu His Leu 
65 

Pro Ala Thr Asp Ala 
85 

Val Ser lie Gly Val 
100 

Gin Leu Leu Phe Asp 
115 

lie Thr Pro Thr His 
130 

Ser Gly Leu Arg Ala 
145 

Ala Cys Phe Glu His 
165 

Asp Gly Glu Gin lie 
180 

Glu Gly Phe Ala Gin 
195 

Glu Ser Gly Cys Phe 
210 

Gin Gin Ala Gin lie 

225 

Ser Gly Gly Cys Phe 
245 

Gly Glu Pro Leu Arg 
260 



an environmental sample 



Ala Ala Ala 

Val Ala Lys 

Gin Phe Ala 
40 

Ser Ala Val 
55 

Lys Leu Leu 
70 

lie Gly Glu 



Val Gin 
10 

Val Val 

25 

Thr Phe 
Gin Thr 
Asp Gin 



Ala Ala 
90 

Asn Glu Arg Asp Gly 
105 

Thr Leu 



Ala Asp Gly 
120 

Phe Glu Arg 
135 

Val Asp Ser 
150 

Asn Asn Pro 

His Ser Ala 

Arg Met Glu 
200 

Val Val Asn 
215 

Met Lys Asp 
230 

Thr Thr lie 
Ser Gly Glu 



Met lie 

Lys Val 

Leu Ala 
170 
Met Tyr 
185 

lie Asn 

Ala Thr 

Thr Gly 

Val Ala 
250 
Gly Glu 
265 



Leu Ser 
Arg Lys 
Pro Glu 

Pro Met 

60 

Ala Val 
75 

Arg Lys 

Gly Thr 

lie Gin 

Trp Gly 
140 
Gly Arg 
155 

Arg Tyr 

Pro Gly 

lie Arg 

Ala Trp 
220 
Cys Gly 
235 

Pro Asp 
Val lie 



Pro 

He 

Thr 
45 
Gin 

Thr 

Ala 

Leu 

Arg 
125 
Gin 

He 

Ala 

Ser 

Gin 
205 
Leu 

He 

Gly 

Ala 



Val Leu 

15 
His Glu 
30 

Val Val 

Leu Leu 

Val Pro 

Gly Met 
95 

Tyr Asn 
110 

Arg Arg 

Gly Asp 

Gly Gin 

Met Met 
175 
Ala Phe 
190 

His Ala 

Asp Ala 

Gly Pro 

Thr Leu 
255 
Asp Leu 
270 



Tyr 

Leu 

Pro 

Ala 

Ser 

80 

Val 

Thr 

Lys 

Gly 

Leu 
160 
Ala 

Gly 

Leu 

Asp 

He 
240 
Leu 

Asp 
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Phe Thr Glu He Asp Arg Arg Lys Met Leu Met Asp Ser Ala Gly His 

275 280 285 

Tyr Asn Arg Pro Glu Leu Leu Ser Leu Leu He Asp Arg Thr Pro Thr 

290 295 300 

Ala Asn Val His Glu Arg Met Ala His Pro Gin Ala Ser Thr Lys Gin 
305 310 315 320 

Pro Arg Ser Gly Asp Leu Pro Ala Ala Leu Ala Gly Ala Gin Glu He 
325 330 335 

Leu 

<210> 265 
<211> 999 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

atgcttaSc taggcatagt ccagatgaac gcagagccgc tcaacgtgga aggcaacctg 60 
ctcaaggcgg agcgctatgt cgcgaagtgc gccgcggacg gcgcccaact cgtggtgctg 12U 
ccggagatgt tcaacgtcgg cttccacctc ggcgagtccc tgatgatggt cgccgagccc 
ctggacggca agaccgtgca gtggctgcaa cggcaggcgt ccacccataa catatatatc 
accgggagct tatacgagcg ttacgacgag catttctaca acaccatggt catggtggga 
tacgacggca gcgtgcagta ctaccgcaag cgcaatccta cctggtccga gtcggcggtg 
tggcgccgca gcgaggtgcc aggccccggt atattcgata ccccgttcgg gcgcatcggg 
ggcgtcatct gcttcgattc cttcgcgcgc gagacccacg agggcttcaa gcagagcggg 
gtcgaggcgg tggtaatcat cgccctgtgg ggcgccaacc gtgcgcgggc attcttctgg 
cgcccggacc tcctgctaag ccgggaaggg ctggtccgtt ggtcccggct ggcctcggag 
qacgtccccc gaaatcacgc gaaagagctc ggggtcccgg tcgccttcgt caaccagagt 
agcaccatcc gcatgaccag ccccatccct ttccccgact ggccggtgca gagctccttc 
tacgacttca tcggcaagtc ccacgtccgg gacgcatccg gagaggtgat cgcgagggtg 
gacgaggggg agatcgactc ctgcctggtg gtcccggtag aggtcgagca ggcgcagagc 
aggccggaga tcaggaagtc aaatatatcg cccggctacc tcggcaagga ttactatttc 
gtggagccgc cgcttatctg caagctcttc caggtctggt tcctcagcgg cctggtgccc 
accgaatacg aggcgcggcg tctgcgccac ctgttctga 

<210> 266 

<211> 332 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 266 



35 

Leu Gl} 
50 



65 



Leu 


Gly 
5 


He 


Val 


Gin 


Met 


Asn 
10 


Ala 


Glu 


Pro 


Leu 


Asn 
15 


Val 


Leu 


Leu 


Lys 


Ala 


Glu 


Arg 


Tyr 


Val 


Ala 


Lys 


Cys 


Ala 


Ala 


20 








25 










30 




Phe 


Gin 


Leu 


Val 


Val 


Leu 


Pro 


Glu 


Met 


Phe 


Asn 


Val 


Gly 








40 










45 






Lys 


Glu 


Ser 


Leu 


Met 


Met 


Val 


Ala 


Glu 


Pro 


Leu 


Asp 


Gly 






55 










60 








He 
80 


Trp 


Leu 


Gin 
70 


Arg 


Gin 


Ala 


Ser 


Thr 
75 


His 


Asn 


He 


Tyr 


Leu 


Tyr 
85 


Glu 


Arg 


Tyr 


Asp 


Glu 
90 


His 


Phe 


Tyr 


Asn 


Thr 
95 


Met 


Gly 
100 


Tyr 


Asp 


Gly 


Ser 


Val 
105 


Gin 


Tyr 


Tyr 


Arg 


Lys 
110 


Arg 


Asn 



180 
240 
300 
360 
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-Pro Thr Trp Ser 
115 

Pro Gly lie Phe 
130 

Phe Asp Ser Phe 
145 

Val Glu Ala Val 

Ala Phe Phe Trp 
180 

Arg Trp Ser Arg 
195 

Glu Leu Gly Val 
210 

Met Thr Ser Pro 
225 

Tyr Asp Phe lie 

lie Ala Arg Val 
260 

Val Glu Val Glu 
275 

lie Ser Pro Gly 
290 

Leu lie Cys Lys 
305 

Thr Glu Tyr Glu 



Glu Ser Ala Val 
120 

Asp Thr Pro Phe 
135 

Ala Arg Glu Thr 
150 

Val lie lie Ala 
165 

Arg Pro Asp Leu 

Leu Ala Ser Glu 
200 

Pro Val Ala Phe 
215 

lie Pro Phe Pro 
230 

Gly Lys Ser His 
245 

Asp Glu Gly Glu 

Gin Ala Gin Ser 
280 

Tyr Leu Gly Lys 
295 

Leu Phe Gin Val 
310 

Ala Arg Arg Leu 
325 



Trp Arg Arg Ser 

Gly Arg lie Gly 
140 

His Glu Gly Phe 
155 

Leu Trp Gly Ala 
170 

Leu Leu Ser Arg 
185 

Asp Val Pro Arg 

Val Asn Gin Ser 
220 

Asp Trp Pro Val 
235 

Val Arg Asp Ala 
250 

lie Asp Ser Cys 
265 

Arg Pro Glu He 

Asp Tyr Tyr Phe 
300 

Trp Phe Leu Ser 
315 

Arg His Leu Phe 
330 



Glu Val Pro Gly 
125 

Gly Val He Cys 

Lys Gin Ser Gly 
160 

Asn Arg Ala Arg 
175 

Glu Gly Leu Val 
190 

Asn His Ala Lys 
205 

Gly Thr He Arg 

Gin Ser Ser Phe 
240 

Ser Gly Glu Val 
255 

Leu Val Val Pro 
270 

Arg Lys Ser Asn 
285 

Val Glu Pro Pro 

Gly Leu Val Pro 
320 



<210> 267 
<211> 1038 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 

<400> 267 ^ 
atgggcatta cgcatccgaa gttcaaggcc gcggcggtac aggcggcgcc gggctttctc 
gacagcgagg ccaccgtcga caagacgatc cgcctgatgc aggaagcggc ggaccacggc 
gcctcgctga tcgtctttcc ggaagcctgg ctgcccggtt atccgtggtg gatctggctc 18 0 

ggtccgcccg cctggggcat gcagttcgtg cagcgctact tcgacaattc gccgagcgtc 240 
ggcgatgatc ttttccgccg gatcgagcgc gcggccgcca aggcgaagat cgaagtggtc 30 0 

ctcggtctca gcgagcgcgc tgccggctcg ctgtacctcg cgcaggcgtt catctcctca 360 
acgggcgaga cgcgcgcagt gcgccgcaag ttgcgaccaa cgcacgtcga gcgaaccgtt 42 0 

ttcggcgagg gcgatggcag cgacttcaag gtgttcgaca ctccgctggg ccgcgtcggt 4bU 
ggtctcttgt gctgggaaca cctgcaaccg ctgtcgcgct acgcgatgtt ctcgatgaac 540 
gagcaggtgc acgccgccgc ctggccgacg ttcagcctct acacggattt tgcccatgcc 
ctcggccacg aactgaatct cgcagccagc gctacttacg cggctgaagg gcagtgctac 
gtgattgccg cctgtggcgt ggtcacgcag gagatgctgg atctgatgaa ggcgccgtgc 
cccccggaat atctgcgggt cggcggcgga tacgccatga tctttgcgcc cgacggacgg 
cgcattgcgg cggcgctgcc gccggaacaa gaagggctga tttacgccga catcgatctt 
tcgatgatct ctctcgccaa ggcggctgcc gatcccaccg gtcactactc gcggccggat 
gtcgtgcggc ttatgctgaa taccgaaccg atgcagcggg tcgaaaagct gcagccgccg 
ctggactcag ccgcgcgccg tgagaatgaa ccggcacgcg agaccgcagc ggcgaccgag 1020 
agccgccagc cccagtaa 



60 
120 



600 
660 
720 
780 
840 
900 
960 



<210> 268 
<211> 344 
<212> PRT 
<213> Unknown 
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<220> 

<223> Obtained from an environmental sample 
<400> 268 

Met Glv He Thr His Pro Lys Phe Lys Ala Ala Ala Val Gin Ala Ala 

15 10 15 

Pro Gly Phe Leu Asp Ser Glu Ala Thr Val Asp Lys Thr He Arg Leu 

20 25 30 

Met Gin Glu Ala Ala Asp His Gly Ala Ser Leu He Val Phe Pro Glu 

35 40 45 

Ala Trp Leu Pro Gly Tyr Pro Trp Trp He Trp Leu Gly Pro Pro Ala 

50 55 60 

Trp Gly Met Gin Phe Val Gin Arg Tyr Phe Asp Asn Ser Pro Ser Val 
65 70 75 80 

Gly Asp Asp Leu Phe Arg Arg He Glu Arg Ala Ala Ala Lys Ala Lys 

85 90 95 

He Glu Val Val Leu Gly Leu Ser Glu Arg Ala Ala Gly Ser Leu Tyr 

100 105 HO 

Leu Ala Gin Ala Phe He Ser Ser Thr Gly Glu Thr Arg Ala Val Arg 

115 120 . 125 

Arg Lys Leu Arg Pro Thr His Val Glu Arg Thr Val Phe Gly Glu Gly 

130 135 140 

Asp Glv Ser Asp Phe Lys Val Phe Asp Thr Pro Leu Gly Arg Val Gly 
145 150 155 160 

Glv Leu Leu Cys Trp Glu His Leu Gin Pro Leu Ser Arg Tyr Ala Met 

165 170 175 

Phe Ser Met Asn Glu Gin Val His Ala Ala Ala Trp Pro Thr Phe Ser 

180 185 190 

Leu Tyr Thr Asp Phe Ala His Ala Leu Gly His Glu Leu Asn Leu Ala 

195 200 205 

Ala Ser Ala Thr Tyr Ala Ala Glu Gly Gin Cys Tyr Val He Ala Ala 

210 215 220 

Cvs Gly Val Val Thr Gin Glu Met Leu Asp Leu Met Lys Ala Pro Cys 
225 230 235 240 

Pro Pro Glu Tyr Leu Arg Val Gly Gly Gly Tyr Ala Met He Phe Ala 

245 250 255 

Pro Asp Gly Arg Arg He Ala Ala Ala Leu Pro Pro Glu Gin Glu Gly 

260 265 270 

Leu He Tyr Ala Asp He Asp Leu Ser Met He Ser Leu Ala Lys Ala 

275 280 285 

Ala Ala Asp Pro Thr Gly His Tyr Ser Arg Pro Asp Val Val Arg Leu 

290 295 300 

Met Leu Asn Thr Glu Pro Met Gin Arg Val Glu Lys Leu Gin Pro Pro 
305 310 315 320 

Leu Asp Ser Ala Ala Arg Arg Glu Asn Glu Pro Ala Arg Glu Thr Ala 

325 330 335 

Ala Ala Thr Glu Ser Arg Gin Pro 
340 

<210> 269 
<211> 1014 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 269 

atgtctgaaa cagccttcaa gatcgcggtc gtacaggcgg ctccggtttt tctcgacgca 
aaggcgacgg tggacaaggc gatcggtctg atggccgaag ccggcgccaa gggcgccaag 



60 
120 
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ctgctcgcat 
ccggcatggg 
cctgaattgg 
ttctcggaga 
gagataatgt 
gaaggggacg 
tgttgcgccg 
gtgcacgtcg 
cacgaggtca 
catgcgagcg 
acgcaactgc 
cccgatggcc 
gatattgacc 
gcgcgcggag 
ttcggcgggg 



tcccggaagt 
gcatgcagtt 
cagccctcgc 
tcgacggcgg 
tcaaacggcg 
ggtccgactt 
aacacataca 
cctcctggcc 
atctggccgc 
ccatcaccgg 
tgaactccga 
agccccttgt 
tggcgaacat 
acgtggtgcg 
gcgccggcga 



attcatcccc 
tgttgccaag 
ggcggcggcg 
ttccctctac 
caagctgaag 
ccaggtcgtg 
gccgctgtcg 
gtcatttact 
gagccagatt 
ccaggacatg 
cggcggcaag 
tgggcatctg 
ttccgttgcc 
cttgatggtc 
ggcagcaacc 



ggctaccctt 
tatcacgcga 
gcgaagtccg 
atcagccagg 
ccgacgcacg 
gacacgagcg 
aagtacgcga 
ttgtaccgcg 
tatgcgctcg 
tttgacgtgc 
gtcggcggcg 
cctcaagaca 
aaagcggcct 
aatcgcaacc 
tggacggaag 



ggtggctgtg 
actcgcttcg 
atatcaatgc 
cgctcatcag 
tcgaacgcac 
tcggcaggct 
tgtactccat 
gcacggcata 
agggaggctg 
tgtgcgacac 
gctactcgat 
ccgagggaat 
acgacccgtc 
cgcggcatac 
caaaagcgga 



gctgggcaca 
tgcagacggg 
cgtcatcggc 
cgacaagggc 
gttgttcggc 
cggtgccttg 
gcacgaacag 
tgccttgggc 
tttcgtcctt 
tccggagagg 
gatcttcggt 
actctacgca 
cggacactat 
gagtgttgcg 
gtga 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1014 



<210> 270 
<211> 337 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 270 

Met Ser Glu Thr Ala 

1 5 
Phe Leu Asp Ala Lys 
20 

Glu Ala Gly Ala Lys 
35 

He Pro Gly Tyr Pro 
50 

Met Gin Phe Val Ala 
65 

Pro Glu Leu Ala Ala 
85 

Ala Val He Gly Phe 
100 

Gin Ala Leu He Ser 
115 

Leu Lys Pro Thr His 
130 

Ser Asp Phe Gin Val 
145 

Cys Cys Ala Glu His 
165 

Met His Glu Gin Val 
180 

Arg Gly Thr Ala Tyr 
195 

Gin He Tyr Ala Leu 
210 

He Thr Gly Gin Asp 
225 

Thr Gin Leu Leu Asn 
245 

Met He Phe Gly Pro 
260 

Asp Thr Glu Gly He 
275 



Phe Lys He Ala 

Ala Thr Val 

Gly Ala Lys 
40 

Trp Trp Leu 
55 

Lys Tyr His 
70 

Leu Ala Ala 

Ser Glu He 

Asp Lys Gly 
120 

Val Glu Arg 
135 

Val Asp Thr 
150 

He Gin Pro 

His Val Ala 

Ala Leu Gly 
200 

Glu Gly Gly 

215 
Met Phe Asp 
230 

Ser Asp Gly 

Asp Gly Gin 

Leu Tyr Ala 
280 



Asp 
25 
Leu 

Trp 

Ala 

Ala 

Asp 
105 
Glu 

Thr 

Ser 

Leu 

Ser 
185 
His 

Cys 

Val 

Gly 



Pro 
265 
Asp 



Val 
10 

Ly s 

Leu 

Leu 

Asn 

Ala 
90 
Gly 

He 

Leu 

Val 

Ser 
170 
Trp 

Glu 

Phe 

Leu 

Lys 
250 
Leu 



He 



Val 

Ala 

Ala 

Gly 

Ser 
75 
Ala 

Gly 

Met 

Phe 

Gly 
155 
Lys 

Pro 

Val 

Val 

Cys 
235 
Val 

Val 

Asp 



Gin Ala Ala 
He 
Phe 



Thr 
60 
Leu 

Lys 

Ser 

Phe 

Gly 
140 
Arg 

Tyr 

Ser 

Asn 

Leu 
220 
Asp 

Gly 

Gly 

Leu 



Gly Leu 

30 
Pro Glu 
45 

Pro Ala 



Arg Ala 

Ser Asp 

Leu Tyr 
110 
Lys Arg 
125 

Glu Gly 

Leu Gly 

Ala Met 

Phe Thr 
190 
Leu Ala 
205 

His Ala 
Thr Pro 
Gly Gly 



His Leu 
270 
Ala Asn 
285 



Pro Val 
15 

Met Ala 

Val Phe 

Trp Gly 

Asp Gly 
80 

He Asn 
95 

He Ser 

Arg Lys 

Asp Gly 

Ala Leu 
160 
Tyr Ser 
175 

Leu Tyr 

Ala Ser 

Ser Ala 

Glu Arg 
240 
Tyr Ser 
255 

Pro Gin 



He Ser 
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Val Ala Lys Ala Ala Tyr Asp Pro 

290 295 
Val Val Arg Leu Met Val Asn Arg 
305 310 
Phe Gly Gly Gly Ala Gly Glu Ala 
325 

Glu 



Ser Gly His Tyr Ala Arg Gly Asp 
300 

Asn Pro Arg His Thr Ser Val Ala 
315 320 
Ala Thr Trp Thr Glu Ala Lys Ala 
330 335 



<210> 271 
<211> 966 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 271 
atgtccagcg 
tatcttgatc 
gaaggagctc 
tggggtattc 
aattcggtca 
aatgcctacg 
tataatacgc 
gtaccaacgg 
tacgatactc 
gcacgctacg 
ggccaaccct 
ggttgctgta 
tattacgctg 
'ggacacttta 
ccacgcatgg 
gatgtgttcc 
tcataa 



agaataacaa 
gtgaagcaac 
gcctgattat 
cttctggtga 
cgattcccag 
tggtgatggg 
tcttgtacat 
gcggtgagcg 
cattgggaaa 
ctatgtatgc 
ggctctcaac 
ttgcgatgcg 
aaatggagga 
ttgcagggcc 
tgcaaggccc 
agttgacggt 



cgctacattc 
catcgacaag 
ctttccagaa 
gcaaggttta 
cgacgcgact 
aatgagtgaa 
agatgcgcag 
cctggtatgg 
actcggtggt 
ctggggtaca 
gttacggcat 
taaagatgat 
atggattaat 
tgtgcgcaag 
gaagtggatg 
gcatacggat 



aaagttgccg 
gcttgcgagt 
gcgttcatcc 
ctcaacgagc 
gacagactgt 
cggaatgtcg 
ggggagattt 
gcgcaaggtg 
ttaatttgct 
caaatctatg 
attgccaaag 
attccagatc 
attggtgaca 
caagaagaaa 
ctcgatgtgg 
gtgaggcaga 



cagttcaggc 
tgatcgctac 
caacctatcc 
tctattcaga 
gcgaggccgc 
aagcgagtgg 
tagggaaaca 
atggcagcac 
gggaaaatta 
tcgcagcaac 
aaggcagggt 
gttactccat 
gcgcgattgt 
ttctttacgc 
ctgggcacta 
tgattcgggt 



cacacctgtg 
tgctggcagc 
tgagtgggta 
gttgctcacc 
gaagcttgct 
tgcaagcctg 
tcgaaagttg 
gctgcaggtc 
tatgccactg 
gtgggatcgc 
atacgtaatt 
gaagcagaag 
caatcccgaa 
ggagatcgat 
tgcgagacca 
ggaacatgat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
966 



<210> 272 
<211> 321 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from 



an environmental sample 



<400> 272 
























Val 


Gin 


Met 


Ser 


Ser 


Glu 


Asn 


Asn 


Asn 


Ala 


Thr 


Phe 


Lys 


Val 


Ala 


Ala 


1 








5 










10 










15 




Ala 


Thr 


Pro 


Val 


Tyr 


Leu 


Asp 


Arg 


Glu 


Ala 


Thr 


He 


Asp 


Lys 


Ala 


Cys 








20 










25 










30 




Phe 


Glu 


Leu 


He 


Ala 


Thr 


Ala 


Gly 


Ser 


Glu 


Gly 


Ala 


Arg 


Leu 


He 


He 






35 










40 










45 




He 


Pro 


Pro 


Glu 


Ala 


Phe 


He 


Pro 


Thr 


Tyr 


Pro 


Glu 


Trp 


Val 


Trp 


Gly 


50 










55 










60 








Thr 


Ser 


Gly 


Glu 


Gin 


Gly 


Leu 


Leu 


Asn 


Glu 


Leu 


Tyr 


Ser 


Glu 


Leu 


Leu 


65 








70 










75 










80 


Asn 


Ser 


Val 


Thr 


He 


Pro 


Ser 


Asp 


Ala 


Thr 


Asp 


Arg 


Leu 


Cys 


Glu 


Ala 








85 










90 










95 




Ala 


Lys 


Leu 


Ala 


Asn 


Ala 


Tyr 


Val 


Val 


Met 


Gly Met 


Ser 


Glu 


Arg 


Asn 






100 










105 










110 






Val 


Glu 


Ala 


Ser 


Gly 


Ala 


Ser 


Leu 


Tyr 


Asn 


Thr 


Leu 


Leu 


Tyr 


He 


Asp 




115 










120 










125 
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Ala Gin Gly Glu lie Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly 

130 135 140 

Gly Glu Arg Leu Val Trp Ala Gin Gly Asp Gly Ser Thr Leu Gin Val 
145 150 155 160 

Tvr Asp Thr Pro Leu Gly Lys Leu Gly Gly Leu lie Cys Trp Glu Asn 

165 170 175 

Tyr Met Pro Leu Ala Arg Tyr Ala Met Tyr Ala Trp Gly Thr Gin He 

180 185 190 

Tyr Val Ala Ala Thr Trp Asp Arg Gly Gin Pro Trp Leu Ser Thr Leu 

195 200 205 

Arg His He Ala Lys Glu Gly Arg Val Tyr Val He Gly Cys Cys He 

210 215 220 

Ala Met Arg Lys Asp Asp He Pro Asp Arg Tyr Ser Met Lys Gin Lys 
225 230 235 240 

Tyr Tyr Ala Glu Met Glu Glu Trp He Asn He Gly Asp Ser Ala He 

245 250 255 

Val Asn Pro Glu Gly His Phe He Ala Gly Pro Val Arg Lys Gin Glu 

260 265 270 

Glu He Leu Tyr Ala Glu He Asp Pro Arg Met Val Gin Gly Pro Lys 

275 280 285 

Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Gin 

290 295 300 

Leu Thr Val His Thr Asp Val Arg Gin Met He Arg Val Glu His Asp 
305 310 315 320 

Ser 



<210> 273 
<211> 1023 
<212> DMA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 273 

atgaataact 

agtcgggatg 

gcacaactgg 

caaccccctg 

ccgggtccgg 

ctgggggtga 

gacggcacat 

tgggggcagg 

gcgctggcat 

gagcagattc 

attgaggtaa 

ggctggctct 

ctgagtggcg 

attaccgatg 

aaacgcatga 

aatgcaacgg 

ctaggtgtgc 

tag 



caccaccaac 
ggactaccga 
ttgtttttcc 
tgttgatggg 
tgacagatgc 
atgagcgaga 
tgttgctgaa 
gagacggtgc 
gttgggaaca 
actgcgctca 
cgattcggca 
ctccagaaca 
ggtgtaatac 
gtgagggcat 
tggattgcgt 
catggtcggt 
cggatgccat 



catccgcgct 
gaaggtgttg 
agaaactttc 
caaagagcac 
ggtcagtcga 
tggtggctca 
gcgacgcaaa 
tggattgaag 
ttacaatccc 
gttccccggt 
tcatgcgttg 
ggtggcacaa 
cgccattatt 
agcgatcgcc 
cggtcactac 
gctggctggg 
gctgtctacg 



gctgccattc 
caggcgatcg 
atcccctact 
atgcggctct 
gcagcccgtt 
atttacaata 
atcaccccta 
gtattggata 
ctggcacgat 
tctctggtgg 
gaatcgggtt 
atcaccacgg 
ggacctgaag 
gatctcgact 
tcccgccctg 
gagcaggggg 
cctaagccag 



agctcagtcc 
ctagtgctgc 
acccctattt 
atgaggaagc 
cttacggcat 
cacagttgat 
cctatcatga 
cagcagtcgg 
ttgcgctgat 
gacaaatttt 
gttttgtggt 
atgaaaagtt 
gcaatcatct 
tctcactaat 
acttgttgaa 
caggtgccag 
aatactcaac 



agttctgttt 

caaggaaggg 

ctcattcatt 

cgtgacggtc 

ggtggtagtg 

tttcgatgct 

gcgcatggtc 

taaggtgggt 

ggcacagcat 

cactgatcag 

gaatgctact 

gcaacgggtg 

ctgtcctccc 

caccaaacgc 

gctgcaactc 

ggagcagggg 

actggatcag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1023 



<210> 274 
<211> 340 
<212> PRT 
<213> Unknown 



<220> 
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<223> Obtained from an environmental sample 
<400> 274 

Met Asn Asn Ser Pro Pro Thr lie Arg Ala Ala Ala lie Gin Leu Ser 

15 10 15 

Pro Val Leu Phe Ser Arg Asp Gly Thr Thr Glu Lys Val Leu Gin Ala 

20 25 30 

lie Ala Ser Ala Ala Lys Glu Gly Ala Gin Leu Val Val Phe Pro Glu 

35 40 45 

Thr Phe He Pro Tyr Tyr Pro Tyr Phe Ser Phe He Gin Pro Pro Val 

50 55 60 

Leu Met Gly Lys Glu His Met Arg Leu Tyr Glu Glu Ala Val Thr Val 
65 70 75 80 

Pro Gly Pro Val Thr Asp Ala Val Ser Arg Ala Ala Arg Ser Tyr Gly 

85 90 95 

Met Val Val Val Leu Gly Val Asn Glu Arg Asp Gly Gly Ser He Tyr 

100 105 HO 

Asn Thr Gin Leu He Phe Asp Ala Asp Gly Thr Leu Leu Leu Lys Arg 

115 120 125 

Arg Lys He Thr Pro Thr Tyr His Glu Arg Met Val Trp Gly Gin Gly 

130 135 140 

Asp Gly Ala Gly Leu Lys Val Leu Asp Thr Ala Val Gly Lys Val Gly 
145 150 155 160 

Ala Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Phe Ala Leu 

165 170 175 

Met Ala Gin His Glu Gin He His Cys Ala Gin Phe Pro Gly Ser Leu 

180 185 190 

Val Gly Gin He Phe Thr Asp Gin He Glu Val Thr He Arg His His 

195 200 205 

Ala Leu Glu Ser Gly Cys Phe Val Val Asn Ala Thr Gly Trp Leu Ser 

210 215 220 

Pro Glu Gin Val Ala Gin He Thr Thr Asp Glu Lys Leu Gin Arg Val 
225 230 235 240 

Leu Ser Gly Gly Cys Asn Thr Ala He He Gly Pro Glu Gly Asn His 

245 250 255 

Leu Cys Pro Pro He Thr Asp Gly Glu Gly He Ala He Ala Asp Leu 

260 265 270 

Asp Phe Ser Leu He Thr Lys Arg Lys Arg Met Met Asp Cys Val Gly 

275 280 285 

His Tyr Ser Arg Pro Asp Leu Leu Lys Leu Gin Leu Asn Ala Thr Ala 

290 295 300 

Tro Ser Val Leu Ala Gly Glu Gin Gly Ala Gly Ala Arg Glu Gin Gly 
305 310 315 320 

Leu Gly Val Pro Asp Ala Met Leu Ser Thr Pro Lys Pro Glu Tyr Ser 
325 330 335 

Thr Leu Asp Gin 
340 

<210> 275 
<211> 849 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 275 ^_ ^ ^ 

atggagacgg ctcacaaagc aaaggtcgat ttccttgtgc tgggtgagac gtggctctca 
ggctacccgg cttggctgga ccactgcccc gatgttggcc ggtgggatta tgaaccgatg 
aaaaaagtgt atttgagatt tcgacaaagt gctatttctg ttcctggcaa agaatttgat 
ttccttactg gcctctgtaa aaaatattca caaacgcttg ccatcggtgt taatgagaaa 24 0 



60 
120 
180 
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gtagatcatg 
ggaacactgt 
ggccatggag 
agcatttgtt 
caaatccatg 
tatgcatttg 
tttccaaagg 
aatgggggga 
cgggaagaac 
atggacgttt 
cgacattga 



gggtaggtaa 
tgaatcatca 
atggccatgg 
gggaacattg 
ttgccctttg 
aagggcgctg 
aacttgtctt 
gctgcgtgat 
tgattgtgtg 
caggtcacta 



tggtaccatt 
tcgcaagtta 
gctgaagtcg 
gatgccacta 
gccgactgtt 
ctttgtattg 
accagactat 
cggccctgat 
tgaacttgac 
ccaaagacga 



tataattcat 
gttcccactt 
atggatactt 
tgcagacaag 
catgacatcc 
gctgccgggc 
ctaaagcaaa 
gggaaatatt 
cttgacgaag 
gacgttttca 



ttctactgat 
ttactgagaa 
cggtgggaag 
cacttcatga 
atcaagtggc 
agatttttgc 
atccggatca 
tgattgagcc 
cttataaaga 
gttttgacgt 



tgattctgat 
attattatac 
aatcggaggg 
tgcaggtgag 
aagtagaagc 
tgctaaagat 
gctcattttg 
cgtgtttgat 
aagaatgacg 
gaaccaacat 



<210> 276 
<211> 310 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 276 

Met Thr Lys Leu Lys 

1 5 
Asp Leu Lys Cys Ser 
20 

His Lys Ala Lys Val 
35 

Gly Tyr Pro Ala Trp 
50 

Tyr Glu Pro Met Lys 
65 

Ser Val Pro Gly Lys 
85 

Tyr Ser Gin Thr Leu 
100 

Val Gly Asn Gly Thr 
115 

Gly Thr Leu Leu Asn 
130 

Lys Leu Leu Tyr Gly 
145 

Thr Ser Val Gly Arg 
165 

Pro Leu Cys Arg Gin 
180 

Ala Leu Trp Pro Thr 
195 

Tyr Ala Phe Glu Gly 
210 

Ala Ala Lys Asp Phe 

225 

Gin Asn Pro Asp Gin 
245 

Pro Asp Gly Lys Tyr 
260 

lie Val Cys Glu Leu 
275 

Met Asp Val Ser Gly 

290 

Val Asn Gin His Arg 
305 



lie Ala lie Gly 
Leu 
Asp 
Leu 



Lys 
70 
Glu 

Ala 

lie 

His 

His 
150 
He 

Ala 

Val 

Arg 

Pro 
230 
Leu 

Leu 

Asp 

His 

His 
310 



Glu Lys Leu 

25 

Phe Leu Val 
40 

Asp His Cys 
55 

Val Tyr Leu 



Phe Asp Phe 

He Gly Val 
105 

Tyr Asn Ser 
120 

His Arg Lys 
135 

Gly Asp Gly 

Gly Gly Ser 

Leu His Asp 
185 

His Asp He 

200 
Cys Phe Val 
215 

Lys Glu Leu 

He Leu Asn 

He Glu Pro 
265 

Leu Asp Glu 

280 
Tyr Gin Arg 
295 



Gin 
10 
Glu 

Leu 

Pro 

Arg 

Leu 
90 
Asn 

Phe 

Leu 

His 

He 
170 
Ala 

His 

Leu 

Val 

Gly 
250 
Val 

Ala 

Arg 



Phe Ser 

Lys He 

Gly Glu 

Asp Val 
60 

Phe Arg 
75 

Thr Gly 

Glu Lys 

Leu Leu 

Val Pro 
140 
Gly Leu 
155 

Cys Trp 

Gly Glu 

Gin Val 

Ala Ala 
220 
Leu Pro 
235 

Gly Ser 

Phe Asp 

Tyr Lys 

Asp Val 
300 



Ser 

Met 

Thr 
45 
Gly 

Gin 

Leu 

Val 

He 
125 
Thr 

Lys 

Glu 

Gin 

Ala 
205 
Gly 

Asp 

Cys 

Arg 

Glu 
285 
Phe 



Asn 

Glu 
30 
Trp 

Arg 

Ser 

Cys 

Asp 
110 
Asp 

Phe 

Ser 

His 

He 
190 
Ser 

Gin 

Tyr 

Val 

Glu 
270 
Arg 

Ser 



His Leu 
15 

Thr Ala 

Leu Ser 

Trp Asp 

Ala He 
80 

Lys Lys 
95 

His Gly 

Ser Asp 

Thr Glu 

Met Asp 
160 
Trp Met 
175 

His Val 

Arg Ser 

He Phe 

Leu Lys 
240 
He Gly 
255 

Glu Leu 
Met Thr 
Phe Asp 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
849 
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<210> 277 
<211> 1056 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 277 

atgccaaccc ccagcgatca tttcaaaatc gccgctgttc aggcctcgcc cgtgtttctg 60 

gaccgggagg ccactgtgga aaaggcctgc cggttgatcg ccgaagccgc aaagcagggc 120 

gcccgcctca tcgtctttcc ggaatctttc atcccgacct acccggactg ggtatgggcc 180 

gttcccccgg gaagggaaag aatcctgaac cagctgtatt ctgaattcct ggccaatgcc 24 0 

gtcgatgttc ccggcgcggc gaccgaacaa cttgcccagg ctgcacgaat ggccggcgcc 300 

tatgtgatta tgggcgtcac cgaaagagac acctcggcca gcggggccag cctctacaac 3 60 

accctgctct acttcagccc cgaaggcatc ctaatgggca aacaccggaa gctggttccc 42 0 

acggggggcg aacggctggt ctgggcctac ggagacggca gcacgctgga ggtctacgac 480 

actccgctgg gaaagatcgg cgggctgatc tgctgggaga actacatgcc cctggcccgg 540 

tacacgatgt acgcctgggg cacccagatt tacatcgccg ccacctggga ccgcggggaa 600 

ccgtggctct ccaccctgcg gcatatcgcc aaggaaggaa gggtctacgt catcgggtgc 6 60 

tgcatcgccc tgcgccaggg ggatatcccg gaccggttcg agtacaaggg aaaattttat 720 

tccgggtccc gggagtggat caatgagggc gacagcgcca tcgtgaaccc ggacggggaa 7 80 

ttcatcgccg ggccggtgcg gatgaaggag gagatcctgt atgccgagat agacccccgg QAn 
cagatgcggg gccccaagtg gatgctcgat gtggccggtc attacgcccg gccggatatc 
ttcgagctca tcgtccaccg gaatccccac ccgatgatca aaatcgccga agacaggggc 

gcggggatcg cctcaagttt gattcgcccc cgccctaacc ttcccccatc aagggggagg 1020 

aaatcggcaa gaagcaaacg caagcccaaa aaatga 1056 

<210> 278 
<211> 351 
<212> PRT 
<213> Unknown 



840 
900 
960 



<220> 

<223> Obtained from an environmental sample 



<400> 278 



Met 


Pro 


Thr 


Pro 


Ser 


Asp 


His 


Phe 


Lys 


He 


Ala 


Ala 


Val 


Gin 


Ala 


Ser 


1 








5 








10 










15 




Pro 


Val 


Phe 


Leu 


Asp 


Arg 


Glu 


Ala 


Thr 


Val 


Glu 


Lys 


Ala 


Cys 


Arg 


Leu 








20 








25 










30 






lie 


Ala 


Glu 


Ala 


Ala 


Lys 


Gin 


Gly 


Ala 


Arg 


Leu 


He 


Val 


Phe 


Pro 


Glu 






35 








40 










45 






Gly 


Ser 


Phe 


He 


Pro 


Thr 


Tyr 


Pro 


Asp 


Trp 


Val 


Trp 


Ala 


Val 


Pro 


Pro 




50 








55 










60 








Ala 


Arg 


Glu 


Arg 


He 


Leu 


Asn 


Gin 


Leu 


Tyr 


Ser 


Glu 


Phe 


Leu 


Ala 


Asn 


65 








70 










75 










80 


Val 


Asp 


Val 


Pro 


Gly 


Ala 


Ala 


Thr 


Glu 


Gin 


Leu 


Ala 


Gin 


Ala 


Ala 


Arg 








85 










90 










95 




Met 


Ala 


Gly 


Ala 


Tyr 


Val 


He 


Met 


Gly 


Val 


Thr 


Glu 


Arg 


Asp 


Thr 


Ser 






100 








105 










110 






Ala 


Ser 


Gly 


Ala 


Ser 


Leu 


Tyr Asn 


Thr 


Leu 


Leu 


Tyr 


Phe 


Ser 


Pro 


Glu 






115 










120 










125 








Gly 


He 


Leu 


Met 


Gly 


Lys 


His 


Arg 


Lys 


Leu 


Val 


Pro 


Thr 


Gly 


Gly 


Glu 


130 










135 










140 










Arg 


Leu 


Val 


Trp 


Ala 


Tyr 


Gly 


Asp 


Gly 


Ser 


Thr 


Leu 


Glu 


Val 


Tyr 


Asp 


145 








150 










155 










160 


Thr 


Pro 


Leu 


Gly 


Lys 


He 


Gly 


Gly 


Leu 


He 


Cys 


Trp 


Glu 


Asn 


Tyr 


Met 








165 










170 










175 


He 


Pro 


Leu 


Ala 


Arg 


Tyr 


Thr 


Met 


Tyr 


Ala 


Trp 


Gly 


Thr 


Gin 


He 


Tyr 
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180 










185 










190 




His 


Ala 


Ala 


Thr 


Trp 


Asp 


Arg 


Gly 


Glu 


Pro 


Trp 


Leu 


Ser 


Thr 


Leu 


Arg 






195 








200 










205 








lie 


Ala 


Lys 


Glu 


Gly 


Arg 


Val 


Tyr 


Val 


He 


Gly 


Cys 


Cys 


He 


Ala 


Leu 




210 








215 










220 










Arg 


Gin 


Glv 


Asp 


lie 


Pro 


Asp 


Arg 


Phe 


Glu 


Tyr 


Lys 


Gly Lys 


Phe 


Tyr 


9 9 s 

<L -J 






230 










235 










240 


OC JL 


Gly 


Ser 


Arg 


Glu 


Trp 


He 


Asn 


Glu 


Gly 


Asp 


Ser 


Ala 


He 


Val 


Asn 






245 










250 










255 




XT iU 


Asp 


Gly 


Glu 


Phe 


lie .Ala 


Gly 


Pro 


Val 


Arg 


Met 


Lys 


Glu 


Glu 


He 




260 










265 










270 






Leu. 


Tyr 


Ala 


Glu 


lie 


Asp 


Pro 


Arg 


Gin 


Met 


Arg 


Gly 


Pro 


Lys 


Trp 


Met 




275 










280 










285 








Leu 


Asp 


Val 


Ala 


Gly 


His 


Tyr 


Ala 


Arg 


Pro 


Asp 


He 


Phe 


Glu 


Leu 


He 




290 










295 










300 








Gly 


Val 


His 


Arg 


Asn 


Pro 


His 


Pro 


Met 


He 


Lys 


He 


Ala 


Glu 


Asp 


Arg 


305 








310 










315 










320 


Ala 


Gly 


lie 


Ala 


Ser 


Ser 


Leu 


He 


Arg 


Pro 


Arg 


Pro 


Asn 


Leu 


Pro 


Pro 








325 










330 










335 




Ser 


Arg 


Gly 


Arg 


Lys 


Ser 


Ala 


Arg 


Ser 


Lys 


Arg 


Lys 


Pro 


Lys 


Lys 






340 










345 










350 







<210> 279 
<211> 1014 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 279 

atgggcatta 

gacctggatg 

tgcaagctga 

ggtgcgccgg 

tcctatgaca 

gcgtcgctgg 

ggccccgacg 

accgtcttcg 

gtgggtgagc 

cagcacgagc 

catgccctgg 

tgtttcgtgg 

accccggaca 

gatggcgccc 

atcgacctcg 

cgccccgacg 

gcgctgccgg 



cccatcccaa 
ccacggtgga 
tcgcgtttcc 
cctgggccat 
gcccgcaggc 
gactgtcgga 
gcgaaacgat 
gcgacggcga 
tggcgtgctg 
aggtgcacat 
gctgggaagt 
tggcaccctg 
agcacaccct 
ccctggcgga 
ggatgattgg 
tgcaccgcct 
tggacgccat 



atacaaagtc 
caagtgcatc 
ggagacgttc 
cggccggggc 
cgaaaagctg 
gcgctcgggc 
ctcgcagcgg 
tgggagcgac 
ggagaacatc 
tgcagcctgg 
gaacaacgca 
cgccatcatt 
gacccatgtg 
caagctgcct 
cgtggccaag 
gcttctcaac 
cgattccgag 



gctgcagtcc 
cgcctgatcc 
attcccggct 
tttgtgcagc 
cgccaggccg 
ggcagtctct 
cgtaagctgc 
ctcaaggtgc 
ctgtcgctga 
cctagcttct 
gtcagcaagg 
tccaaggaga 
ggaggcggcc 
gaagacgccg 
aacgccatgg 
cgcaagaagg 
ccccaagcca 



aggccgcgcc 
aggaggccgc 
atccctggca 
ggtatttcga 
tcaaggccgc 
acatcgcgca 
ggcccacgca 
acgacacgcc 
acaagtacgc 
ccacctacga 
tgtacgcagt 
tggtcgatga 
acgcggtgat 
agggcctgct 
acccggtagg 
cagcgcaggt 
ccacggcgca 



ggtctggctg 
tgacaagggc 
catctggatg 
caactccctg 
aggcatcacc 
gtggctcatt 
ttccgaacgc 
gctgggccgt 
catgttctcg 
gccctttgca 
ggaaggcggg 
actgtgcgac 
ctacgggccg 
gatcgcggag 
gcactactcg 
ggagcacttt 
ctga 
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<210> 280 
<211> 337 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 

<400> 280 - 
Met Gly He Thr His Pro Lys Tyr Lys Val Ala Ala Val Gin Ala Ala 

15 10 15 

Pro Val Trp Leu Asp Leu Asp Ala Thr Val Asp Lys Cys He Arg Leu 
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20 

lie Gin Glu Ala 
35 

Thr Phe lie Pro 
50 

Trp Ala lie Gly 
65 

Ser Tyr Asp Ser 

Ala Gly lie Thr 
100 

Leu Tyr lie Ala 
115 

Gin Arg Arg Lys 
130 

Asp Gly Asp Gly 
145 

Val Gly Glu Leu 

Ala Met Phe Ser 
180 

Phe Ser Thr Tyr 
195 

Asn Ala Val Ser 

210 

Ala Pro Cys Ala 
225 

Thr Pro Asp Lys 

lie Tyr Gly Pro 
260 

Ala Glu Gly Leu 
275 

Ala Lys Asn Ala 
290 

His Arg Leu Leu 
305 

Ala Leu Pro Val 
His 



Ala Asp Lys Gly 
40 

Gly Tyr Pro Trp 
55 

Arg Gly Phe Val 
70 

Pro Gin Ala Glu 
85 

Ala Ser Leu Gly 

Gin Trp Leu lie 

120 

Leu Arg Pro Thr 
135 

Ser Asp Leu Lys 
150 

Ala Cys Trp Glu 
165 

Gin His Glu Gin 

Glu Pro Phe Ala 
200 

Lys Val Tyr Ala 
215 

lie He Ser Lys 
230 

His Thr Leu Thr 
245 

Asp Gly Ala Pro 

Leu He Ala Glu 
280 

Met Asp Pro Val 
295 

Leu Asn Arg Lys 
310 

Asp Ala He Asp 
325 



25 

Cys Lys Leu He 

His He Trp Met 
60 

Gin Arg Tyr Phe 
75 

Lys Leu Arg Gin 
90 

Leu Ser Glu Arg 
105 

Gly Pro Asp Gly 

His Ser Glu Arg 
140 

Val His Asp Thr 
155 

Asn He Leu Ser 
170 

Val His He Ala 
185 

His Ala Leu Gly 

Val Glu Gly Gly 

220 

Glu Met Val Asp 
235 

His Val Gly Gly 
250 

Leu Ala Asp Lys 
265 

He Asp Leu Gly 

Gly His Tyr Ser 
300 

Lys Ala Ala Gin 
315 

Ser Glu Pro Gin 
330 



30 

Ala Phe Pro Glu 
45 

Gly Ala Pro Ala 

Asp Asn Ser Leu 
80 

Ala Val Lys Ala 
95 

Ser Gly Gly Ser 
110 

Glu Thr He Ser 
125 

Thr Val Phe Gly 

Pro Leu Gly Arg 
160 

Leu Asn Lys Tyr 
175 

Ala Trp Pro Ser 
190 

Trp Glu Val Asn 
205 

Cys Phe Val Val 

Glu Leu Cys Asp 
240 

Gly His Ala Val 
255 

Leu Pro Glu Asp 
270 

Met He Gly Val 
285 

Arg Pro Asp Val 

Val Glu His Phe 
320 

Ala Thr Thr Ala 
335 



<210> 281 
<211> 936 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 281 

atgtccaaga tcgccgtcgt ccaagagcct ccggtgctgc tcgatcgcgc cgccaccctc 
gagcgcgccg tctcggccat cgagcgggcg gccgacgtgg gggcgcacct cgtcgtgttc 
cccgagacgt acgtcccggg gtaccccgac tgggtctggc ggacgcgccc cgacgacttc 
aagctcgcgg gcgcgctgca cgagcgcctc ctcgcgaacg cggtcgacct cgaaaaggac 
cagctcgcgc cgctccgcga ggccgcgcgc cggcgaggcg tcaccatcgc gtgcggcgtg 
aacgagcgcg aggggagcca cggccgcgcg accctctaca acaccgtcgt cgtcgtcgga 
cccgacggcg cgatcctcaa ccgccaccgc aagctcgtcc ccacgaaccc cgagcgcatg 
gtctggggcc cgggtgacgc gagcgggctg cgcgtcgtcg acacgccggc cggccgcgtg 
ggggcgctca tctgctggga gaactacatg ccgctcgcgc gcttcgcgct ctacgcgcag 540 
ggcgtcgagg tctacctcgc gccgacgtgg gatcacggcg acacttggct cgcctccatg bOU 
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cggcacatcg cgcgcgaggc gcgcgcctgg gtcgtctcgg gggccatctg catgcaggcg 
aaggacgtcc ccgccgactt cccgcagcgc gcggcgatct accccgacga ggaggagtgg 
ctcaaccccg gcgatgccgt cgtcgtcgat cccaccggcg ccgtcgccgc cggcccgctg 
caccgcgagc gcggcatcct ctacgccgag tgcgatcccg cgcgggcgtc gctcgcccgc 
cgcacgctcg acgtctccgg gcactacgga cggcccgacg tctttcacct gcagatcgac 
cgcacaccgc gcgtgccggc gtcgttccgg gactga 

<210> 282 
<211> 311 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 282 

Met Ser Lys He Ala Val Val Gin Glu Pro Pro Val Leu Leu Asp Arg 

15 10 15 

Ala Ala Thr Leu Glu Arg Ala Val Ser Ala He Glu Arg Ala Ala Asp 

20 25 30 

Val Gly Ala His Leu Val Val Phe Pro Glu Thr Tyr Val Pro Gly Tyr 

35 40 45 

Pro Asp Trp Val Trp Arg Thr Arg Pro Asp Asp Phe Lys Leu Ala Gly 

50 55 60 

Ala Leu His Glu Arg Leu Leu Ala Asn Ala Val Asp Leu Glu Lys Asp 
65 70 75 80 

Gin Leu Ala Pro Leu Arg Glu Ala Ala Arg Arg Arg Gly Val Thr He 

85 90 95 

Ala Cys Gly Val Asn Glu Arg Glu Gly Ser His Gly Arg Ala Thr Leu 

100 105 HO 

Tyr Asn Thr Val Val Val Val Gly Pro Asp Gly Ala He Leu Asn Arg 

115 120 125 

His Arg Lys Leu Val Pro Thr Asn Pro Glu Arg Met Val Trp Gly Pro 

130 135 140 

Gly Asp Ala Ser Gly Leu Arg Val Val Asp Thr Pro Ala Gly Arg Val 
145 150 155 160 

Gly Ala Leu He Cys Trp Glu Asn Tyr Met Pro Leu Ala Arg Phe Ala 

165 170 175 

Leu Tyr Ala Gin Gly Val Glu Val Tyr Leu Ala Pro Thr Trp Asp His 

180 185 190 

Glv Asp Thr Trp Leu Ala Ser Met Arg His He Ala Arg Glu Ala Arg 

195 200 205 

Ala Trp Val Val Ser Gly Ala He Cys Met Gin Ala Lys Asp Val Pro 

210 215 220 

Ala Asp Phe Pro Gin Arg Ala Ala He Tyr Pro Asp Glu Glu Glu Trp 
225 230 235 240 

Leu Asn Pro Gly Asp Ala Val Val Val Asp Pro Thr Gly Ala Val Ala 

245 250 255 

Ala Gly Pro Leu His Arg Glu Arg Gly He Leu Tyr Ala Glu Cys Asp 

260 265 270 

Pro Ala Arg Ala Ser Leu Ala Arg Arg Thr Leu Asp Val Ser Gly His 

275 280 285 

Tyr Gly Arg Pro Asp Val Phe His Leu Gin He Asp Arg Thr Pro Arg 

290 295 300 

Val Pro Ala Ser Phe Arg Asp 
305 310 

<210> 283 

<211> 1017 

<212> DNA 

<213> Unknown 
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<220> 

<223> Obtained from an environmental sample 



<400> 283 

atgggcatcg 

gatctcgacg 

gccaagctga 

gactcgccgg 

gcctatgaca 

gccgtgatcg 

gggcccgacg 

acggtatacg 

cgccttggcg 

gcgcagaacg 

gcggtggcgc 

tcctgcttcg 

gaccggccgg 

cctgacggca 

gagatcgatc 

tcgcggcctg 

ttttcgccgc 



aacatccgaa 
cgtcgatcga 
ttgcattccc 
cctgggcgat 
gcccgcaggc 
ggctgtccga 
gtgagaccat 
gcgagggcga 
cgctctgctg 
agcaggtgca 
tcggcgccga 
tgctggcgcc 
acaagcacgc 
gccagatcgg 
tcggcgccat 
acgtgacgcg 
cggccgaggc 



gtacaaggtc 
caagaccatc 
cgaggccttc 
cggccgcggt 
cgagaaactg 
gcgcgacggc 
cgccaagcgc 
cggcagcgac 
ctgggagcat 
tgtcgcggcc 
agtgaacaac 
gtgcgcgacg 
gctgttgcat 
cgacaaactc 
cggcgtcgcc 
actgttgctc 
gctcgagccg 



gcggtggtgc 
gggctgatcg 
atccccggtt 
ttcgtgcagc 
cgcgcggcgg 
ggcagtcttt 
cgcaagctgc 
ctcgcggttc 
ctccagccgc 
tggccgagct 
gcggcctcgc 
gtctcgcagg 
gtcggcggcg 
gctcccgacc 
aagaatgcgg 
aacaagaagc 
acggatatcg 



aggcggcacc 
aggaggccgc 
acccctggca 
gctattttga 
ttcgcaaggc 
acctcgcgca 
ggccgacaca 
acaaccgtcc 
tgtcgaaata 
tttcgctgta 
gcgtctatgc 
ccatgatcga 
gttttgccgc 
aggaagggtt 
cggatcccgc 
cgtacaagcg 
cagcagcagc 



ggcctggctc 
ccaaaaaggc 
catctggatg 
caattcgctc 
aaagctcacg 
atggctgatc 
tgcggagcgc 
ggacattggc 
cgcgatgtac 
cgatcccttc 
ggtcgaaggc 
cgaactctgc 
gatctacggt 
gctgatcgcg 
cgggcattat 
cgtcgagcag 
aagctaa 



<210> 284 
<211> 338 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from 
<400> 284 

Met Gly lie Glu His 

1 5 
Pro Ala Trp Leu Asp 
20 

lie Glu Glu Ala Ala 
35 

Ala Phe lie Pro Gly 
50 

Trp Ala lie Gly Arg 
65 

Ala Tyr Asp Ser Pro 
85 

Ala Lys Leu Thr Ala 
100 

Leu Tyr Leu Ala Gin 
115 

Lys Arg Arg Lys Leu 
130 

Glu Gly Asp Gly Ser 
145 

Arg Leu Gly Ala Leu 
165 

Tyr Ala Met Tyr Ala 
180 

Ser Phe Ser Leu Tyr 
195 

Asn Asn Ala Ala Ser 
210 

Leu Ala Pro Cys Ala 



an environmental sample 



Pro Lys Tyr Lys 
Leu Asp 
Gin Lys 



Tyr Pro 

55 
Gly Phe 
70 

Gin Ala 

Val He 

Trp Leu 

Arg Pro 
135 
Asp Leu 
150 

Cys Cys 

Gin Asn 

Asp Pro 

Arg Val 
215 
Thr Val 



Ala Ser 

25 
Gly Ala 
40 

Trp His 



Val Gin 

Glu Lys 

Gly Leu 
105 
He Gly 
120 

Thr His 

Ala Val 

Trp Glu 

Glu Gin 
185 
Phe Ala 
200 

Tyr Ala 



Val 

10 

He 

Lys 

He 

Arg 

Leu 

90 

Ser 

Pro 

Ala 

His 

His 
170 
Val 



Ala Val Val Gin 
Asp Lys 
Leu He 



Val 
Val 
Ser Gin Ala 



Trp Met 

60 
Tyr Phe 
75 

Arg Ala 

Glu Arg 

Asp Gly 

Glu Arg 
140 
Asn Arg 
155 

Leu Gin 

His Val 

Ala Leu 

Glu Gly 
220 
Met He 



Thr He 

30 
Ala Phe 
45 

Asp Ser 



Asp Asn 

Ala Val 

Asp Gly 
110 
Glu Thr 
125 

Thr Val 

Pro Asp 

Pro Leu 

Ala Ala 
190 
Gly Ala 
205 

Ser Cys 



Ala Ala 
15 

Gly Leu 

Pro Glu 

Pro Ala 

Ser Leu 
80 

Arg Lys 
95 

Gly Ser 

He Ala 

Tyr Gly 

He Gly 
160 
Ser Lys 
175 

Trp Pro 



Glu Val 
Phe Val 
Asp Glu Leu Cys 
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225 

Asp Arg Pro Asp 

Ala He Tyr Gly 

260 

Asp Gin Glu Gly 
275 

Val Ala Lys Asn 
290 

Val Thr Arg Leu 
305 

Phe Ser Pro Pro 
Ala Ser 



230 

Lys His Ala Leu 
245 

Pro Asp Gly Ser 

Leu Leu He Ala 
280 

Ala Ala Asp Pro 

295 

Leu Leu Asn Lys 
310 

Ala Glu Ala Leu 
325 



235 

Leu His Val Gly 
250 

Gin He Gly Asp 
265 

Glu He Asp Leu 

Ala Gly His Tyr 
300 

Lys Pro Tyr Lys 
315 

Glu Pro Thr Asp 
330 



240 

Gly Gly Phe Ala 
255 

Lys Leu Ala Pro 
270 

Gly Ala He Gly 
285 

Ser Arg Pro Asp 

Arg Val Glu Gin 
320 

He Ala Ala Ala 
335 



<210> 285 
<211> 918 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 285 

atgaatacta 

agcaatacgg 

ctaatcgtat 

aatgcgaatt 

atccctgata 

gcggtaggta 

ctatacatca 

ggcgaacggc 

ttcggcaaat 

atgtattccg 

ttgttaagca 

gctctccgaa 

tcagaatgga 

ggaccaatct 

aaatctaaac 

gaaatcaaca 



atctagtaaa 
tggaaaaaac 
ttccagaagc 
ctgcaatgct 
caacaacaca 
tacatgagag 
acgatcaagg 
tggtctgggg 
taggaggatt 
ttggaactga 
tgcgccatat 
aagacgacat 
tcaatagcgg 
caaaccaaca 
gtatgttcga 
aaagctag 



ggtcgcggcg 
ctgcaactta 
cttcatctct 
ggatgattta 
taaactatgt 
aaattcagaa 
cgtaatcatc 
gcagggtaat 
gctttgttgg 
agtatatgta 
tgcccgagag 
ccctgatcag 
taacagttgc 
agaaatactt 
tgttactggg 



gctcaagttg 
atttctgaag 
ggttatcccg 
taccaggaat 
caggctgcaa 
gcaagtggct 
ggaaaacacc 
ggggatacac 
gaaaattata 
gccccaacct 
ggcggtatgt 
tatgaattta 
atcatcaacc 
tatgcagatt 
cattattccc 



ctccccattt 
caggcaaaaa 
attgggtctg 
tggttgagaa 
aagatgcagg 
tcacgctttt 
gaaaattaat 
tttctgcatt 
tgccactcgc 
gggactccag 
ttgtaattag 
agaaactcta 
cgcgcggtga 
tagacctgag 
ggccggatgt 



tctcaatttg 
tggagcaaag 
gcttattccc 
cgcagtaacg 
ggtatatgtt 
caatacgctt 
ccctacaggg 
cgatacagac 
gcgtcaagct 
cgagaattgg 
tgtttgccag 
tcctgataat 
gattattgct 
tttaattgca 
gtttagctat 
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<210> 286 
<211> 305 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 286 

Met Asn Thr Asn Leu Val Lys Val Ala Ala Ala Gin Val Ala Pro His 

1 5 10 15 

Phe Leu Asn Leu Ser Asn Thr Val Glu Lys Thr Cys Asn Leu He Ser 

20 25 30 

Glu Ala Gly Lys Asn Gly Ala Lys Leu He Val Phe Pro Glu Ala Phe 

35 40 45 

He Ser Gly Tyr Pro Asp Trp Val Trp Leu He Pro Asn Ala Asn Ser 

50 55 60 

Ala Met Leu Asp Asp Leu Tyr Gin Glu Leu Val Glu Asn Ala Val Thr 
65 70 75 80 
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lie Pro Asp Thr Thr Thr His Lys Leu Cys Gin Ala Ala Lys Asp Ala 

85 90 95 

Gly Val Tyr Val Ala Val Gly lie His Glu Arg Asn Ser Glu Ala Ser 

100 105 HO 

Gly Phe Thr Leu Phe Asn Thr Leu Leu Tyr lie Asn Asp Gin Gly Val 

115 120 125 

lie He Gly Lys His Arg Lys Leu He Pro Thr Gly Gly Glu Arg Leu 

130 135 140 

Val Trp Gly Gin Gly Asn Gly Asp Thr Leu Ser Ala Phe Asp Thr Asp 
145 150 155 160 

Phe Gly Lys Leu Gly Gly Leu Leu Cys Trp Glu Asn Tyr Met Pro Leu 

165 1*70 175 

Ala Arq Gin Ala Met Tyr Ser Val Gly Thr Glu Val Tyr Val Ala Pro 

180 185 190 

Thr Trp Asp Ser Ser Glu Asn Trp Leu Leu Ser Met Arg His He Ala 

195 200 205 

Arg Glu Gly Gly Met Phe Val He Ser Val Cys Gin Ala Leu Arg Lys 

210 215 220 

Asp Asp He Pro Asp Gin Tyr Glu Phe Lys Lys Leu Tyr Pro Asp Asn 
225 230 235 240 

Ser Glu Trp He Asn Ser Gly Asn Ser Cys He He Asn Pro Arg Gly 

245 250 255 

Glu He He Ala Gly Pro He Ser Asn Gin Gin Glu He Leu Tyr Ala 

260 265 270 

Asp Leu Asp Leu Ser Leu He Ala Lys Ser Lys Arg Met Phe Asp Val 

275 280 285 

Thr Gly His Tyr Ser Arg Pro Asp Val Phe Ser Tyr Glu He Asn Lys 
290 295 300 

Ser 
305 

<210> 287 
<211> 936 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 287 

gtgatcaagg tagcaatcgc ccaggtggca ccggtggttc tggacaaggc gcgcaccatt 
gagaaagcgg taggaattat tcgcgctgcc gcgcaagagg gcattgagct cctggttttc 
ccggagacgt ttatcccgac ctatccagcc tgggtatggc gcttgcgtcc gggtactgat 
tacggcctga gcgaggaact gcacgcgctc ctgctggata attcggtaga tatggagagc 
aaggacctgg agccattgca agctgttgct gcagagacca gcatgaccgt ggtaataggt 
atgaacgagc gagacggccg attcagccgg ggtacaatct acaatgccct ggttgtgatc 
ggtccaggtg gcacgatcct gaacaggcac cgcaagctta tgcccaccaa ccccgagcgt 
atggtttggg gtatgggcga tgccagcggg ctgaaggtag tggaaatgtc ttacgggcgc 
ctgggtgggc tgatttgctg ggagaatttc atgcctctcg cgcgctatgg cttgtatgcc 
caqqqtgtgg agatttacgt ggcgcccacc tatgaccagg gcgacggctg ggtcggcagc 
atgcagcata tagcccggga gggtcgttgc tgggtactct cggccgggac ccttttgcgt 
ggcagtgatt ttctgccgga ttttccgggc aagaccgagt tatatcccga tgaccaggag 
tgggtgaatc cgggtggctc ggtgatcgtg gcaccggggg gagagattgt ggccggcccc 
atgtatcgcg acgaaggtct gctggtctgc gagttggatg cgacgcttag tgtccgcggc 
aagcgctcgc tggatgtggc cggccattac tcccggccgg atttgtttga actggaaata 
gatggcgacc cgctggaacc catagagtgg gattga 

<210> 288 
<211> 311 
<212> PRT 
<213> Unknown 
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<220> 

<223> Obtained from an environmental sample 
<400> 288 

Val He Lys Val Ala He Ala Gin Val Ala Pro Val Val Leu Asp Lys 

15 10 15 

Ala Arg Thr He Glu Lys Ala Val Gly He He Arg Ala Ala Ala Gin 

20 25 30 

Glu Gly He Glu Leu Leu Val Phe Pro Glu Thr Phe He Pro Thr Tyr 

35 40 45 

Pro Ala Trp Val Trp Arg Leu Arg Pro Gly Thr Asp Tyr Gly Leu Ser 

50 55 60 

Glu Glu Leu His Ala Leu Leu Leu Asp Asn Ser Val Asp Met Glu Ser 
65 70 75 80 

Lvs Asp Leu Glu Pro Leu Gin Ala Val Ala Ala Glu Thr Ser Met Thr 

85 90 95 

Val Val He Gly Met Asn Glu Arg Asp Gly Arg Phe Ser Arg Gly Thr 

100 105 HO 

He Tyr Asn Ala Leu Val Val He Gly Pro Gly Gly Thr He Leu Asn 

115 120 125 

Arg His Arg Lys Leu Met Pro Thr Asn Pro Glu Arg Met Val Trp Gly 

130 135 140 

Met Gly Asp Ala Ser Gly Leu Lys Val Val Glu Met Ser Tyr Gly Arg 
145 150 155 160 

Leu Gly Gly Leu He Cys Trp Glu Asn Phe Met Pro Leu Ala Arg Tyr 

165 170 175 

Gly Leu Tyr Ala Gin Gly Val Glu He Tyr Val Ala Pro Thr Tyr Asp 

180 185 190 

Gin Gly Asp Gly Trp Val Gly Ser Met Gin His He Ala Arg Glu Gly 

195 200 205 

Arg Cys Trp Val Leu Ser Ala Gly Thr Leu Leu Arg Gly Ser Asp Phe 

210 215 220 

Leu Pro Asp Phe Pro Gly Lys Thr Glu Leu Tyr Pro Asp Asp Gin Glu 
225 230 235 240 

Trp Val Asn Pro Gly Gly Ser Val He Val Ala Pro Gly Gly Glu He 

245 250 255 

Val Ala Gly Pro Met Tyr Arg Asp Glu Gly Leu Leu Val Cys Glu Leu 

260 265 270 

Asp Ala Thr Leu Ser Val Arg Gly Lys Arg Ser Leu Asp Val Ala Gly 

275 280 285 

His Tyr Ser Arg Pro Asp Leu Phe Glu Leu Glu He Asp Gly Asp Pro 

290 295 300 

Leu Glu Pro He Glu Trp Asp 
305 310 

<210> 289 
<211> 921 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<400> 289 ^ _ n 

atggtgacgg tggccgccgt acaggcaacg ccggtgttcc tcgaccgcga ggcgacctcg 60 

gacaaggtct gcgccttggt caaggaggcg gccggccacg gggcagaact gatcgtcttc 12 0 

cccgagtcct tcgtccctgc ctatccggac tgggtgtggc gcacccctgc ctggagtgac 18 0 

accgagttcg tgaagcgctt ctacgcgaac gcggtgaccg tccccggcgc gaccctcgag 24 0 

cgcatcggcg cagcggcggc ggaggcggag gcgtacgtcg tgatcggcgt gaccgagatc 300 
gacggcggaa ctctctacaa cacccttctc tacctgggcc cggacggaca gctgttgcaa 



300 
360 



Page 189 



WO 03/097810 



PCT/US03/15712 



600 
660 
720 
780 



900 
921 



cggcatcgca agctcatgcc caccggtggg gagcggaccg tgtggggaat gggagacggc 420 
tctgagctcg acgtcgtgag cacgccgttc ggcgtcgtcg gtgggttgtt gtgctgggag 4 80 

aactacatgc cgctcgcccg ggcggcgatc tacgcccagc actgtgacat ctacctggct 540 
ccgacatggg acaacagcga cacgtgggta gccacgttgc gtcacatcgc caaggagggg 
cggcagttcg tcatcggcgt cgccccgctg ctgcgcggct ccgacgtacc ggaggacctc 
cgcggcacgc tctacgggct gtcggacgac tggatgtcgc gcggctacac caccatcgtc 
gcaccaagcg gcgaggtgat cgccggcccg gtcctggagc gtgaggagat cctcttcgcg 
gacctcgacc tggccgacgt gcaggagcag agaaggatgt tcgaccctgt cggccactac 840 
tcacgacccg acgtcttcac gctccacgtc gacgcacgac cgaagagccc ggtcgtcttc 
gagagggatg caccgacctg a 

<210> 290 

<211> 306 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 290 

Met Val Thr Val Ala Ala Val Gin Ala Thr Pro Val Phe Leu Asp Arg 

15 10 15 

Glu Ala Thr Ser Asp Lys Val Cys Ala Leu Val Lys Glu Ala Ala Gly 

20 25 30 

His Gly Ala Glu Leu He Val Phe Pro Glu Ser Phe Val Pro Ala Tyr 

35 40 45 

Pro Asp Trp Val Trp Arg Thr Pro Ala Trp Ser Asp Thr Glu Phe Val 

50 55 60 

Lys Arg Phe Tyr Ala Asn Ala Val Thr Val Pro Gly Ala Thr Leu Glu 
65 70 "75 80 

Arq He Gly Ala Ala Ala Ala Glu Ala Glu Ala Tyr Val Val He Gly 

85 90 95 

Val Thr Glu He Asp Gly Gly Thr Leu Tyr Asn Thr Leu Leu Tyr Leu 

100 105 HO 

Gly Pro Asp Gly Gin Leu Leu Gin Arg His Arg Lys Leu Met Pro Thr 

115 120 125 

Gly Gly Glu Arg Thr Val Trp Gly Met Gly Asp Gly Ser Glu Leu Asp 

130 135 140 

Val Val Ser Thr Pro Phe Gly Val Val Gly Gly Leu Leu Cys Trp Glu 
145 150 155 160 

Asn Tyr Met Pro Leu Ala Arg Ala Ala He Tyr Ala Gin His Cys Asp 

165 170 175 

He Tyr Leu Ala Pro Thr Trp Asp Asn Ser Asp Thr Trp Val Ala Thr 

180 185 190 

Leu Arg His He Ala Lys Glu Gly Arg Gin Phe Val He Gly Val Ala 

195 200 205 

Pro Leu Leu Arg Gly Ser Asp Val Pro Glu Asp Leu Arg Gly Thr Leu 

210 215 220 

Tvr Gly Leu Ser Asp Asp Trp Met Ser Arg Gly Tyr Thr Thr He Val 
225 230 235 240 

Ala Pro Ser Gly Glu Val He Ala Gly Pro Val Leu Glu Arg Glu Glu 

245 250 255 

He Leu Phe Ala Asp Leu Asp Leu Ala Asp Val Gin Glu Gin Arg Arg 

260 265 270 

Met Phe Asp Pro Val Gly His Tyr Ser Arg Pro Asp Val Phe Thr Leu 

275 280 285 

His Val Asp Ala Arg Pro Lys Ser Pro Val Val Phe Glu Arg Asp Ala 

290 295 300 

Pro Thr 
305 " 
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<210> 291 
<211> 1002 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 291 

atgatgaaaa 

gaaggcacca 

ctcatcgctt 

cccgcgacaa 

gcccaggcga 

ttcagcgagc 

gaaactattg 

gaaagcgacg 

tgctgctggg 

atccatatcg 

cctgacgtca 

gccccatgtg 

agagcgttac 

ctcgtcaccc 

gcgatcacct 

acccgccttc 

gagttaattg 



caactgttac 
tagataaaac 
ttccggagac 
atatgcccct 
agcgaattgc 
gcgatcatgg 
gcatccggcg 
gctcctccct 
agcacctgca 
ctgcgtggcc 
acacggcggc 
ccgtggtttc 
tcagtgccgg 
ctctcgggga 
ttgccaaact 
tttttaatcc 
ccgaacaggc 



cgttgcctgc 
gatcaccctc 
ctggataccc 
ggtttatcaa 
ggatgctgca 
aagcctctat 
caagctcaag 
gaccacctgg 
gccgctgtct 
cagcttcagt 
ttcacgcctc 
tgatgagatg 
agggggacat 
aaatgaggaa 
ggcggcagat 
ttcagccaac 
tccggaagaa 



gttcaggccg 
atctctgaag 
ggttacccgt 
tatcatcaga 
cagcagaata 
atctcacagt 
gccacacacg 
gagacacctc 
cgctatgcga 
ctttacacca 
tatgccgcgg 
attgatttac 
gcccgtattt 
ggactgctta 
cccgtaggcc 
aagactgtta 
gaggaggagt 



cccccgtatt 
ccgcacagaa 
ggttcttatg 
actctctggt 
acatcactgt 
ggctgattgg 
tggagcgtac 
tgggtaacgt 
tgtattccca 
gcgcaactgc 
aggggcagtg 
tctgtcctga 
acggacctga 
tcgctgagct 
actattcccg 
ttaaacggca 
ag 



tatggattta 
aggcgcggag 
gctgaactcg 
gctggacagt 
cgttctggga 
cagcgacggg 
gctgttcggc 
cggggccctc 
gcatgaagag 
cgcgctcgga 
cttcgtgtta 
tgatgaccgg 
tggaagagaa 
cgactctgct 
ccctgacgtg 
ttcgcctcct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1002 



<210> 292 
<211> 333 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 292 
Met Met Lys Thr 
1 

Phe Met Asp Leu 
20 

Glu Ala Ala Gin 
35 

He Pro Gly Tyr 
50 

Met Pro Leu Val 
65 

Ala Gin Ala Lys 

Val Val Leu Gly 
100 

Gin Trp Leu lie 
115 

Leu Lys Ala Thr 
130 

Ser Ser Leu Thr 
145 

Cys Cys Trp Glu 

Gin His Glu Glu 
180 

Thr Ser Ala Thr 



Thr Val Thr Val 
5 

Glu Gly Thr He 

Lys Gly Ala Glu 
40 

Pro Trp Phe Leu 
55 

Tyr Gin Tyr His 
70 

Arg He Ala Asp 
85 

Phe Ser Glu Arg 

Gly Ser Asp Gly 
120 

His Val Glu Arg 
135 

Thr Trp Glu Thr 
150 

His Leu Gin Pro 
165 

He His He Ala 
Ala Ala Leu Gly 



Ala Cys Val Gin 
10 

Asp Lys Thr He 
25 

Leu He Ala Phe 

Trp Leu Asn Ser 

60 

Gin Asn Ser Leu 
75 

Ala Ala Gin Gin 
90 

Asp His Gly Ser 
105 

Glu Thr He Gly 

Thr Leu Phe Gly 
140 

Pro Leu Gly Asn 
155 

Leu Ser Arg Tyr 
170 

Ala Trp Pro Ser 
185 

Pro Asp Val Asn 



Ala Ala Pro Val 
15 

Thr Leu He Ser 
30 

Pro Glu Thr Trp 
45 

Pro Ala Thr Asn 

Val Leu Asp Ser 
80 

Asn Asn He Thr 
95 

Leu Tyr He Ser 
110 

He Arg Arg Lys 
125 

Glu Ser Asp Gly 

Val Gly Ala Leu 
160 

Ala Met Tyr Ser 
175 

Phe Ser Leu Tyr 
190 

Thr Ala Ala Ser 
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195 










200 










ZUj 






Ala 


Arg 


Leu 


Tyr 


Ala 


Ala 


Glu 


Gly 


Gin 


Cys 


Phe 


Val 


Leu 


Ala 


Pro 


Cys 


210 










215 










220 








Arg 


Val 


Val 


Ser 


Asp 


Glu 


Met 


lie 


Asp 


Leu 


Leu 


Cys 


Pro 


Asp 


Asp 


Asp 


225 








230 










235 








Gly 


9 A Ct 


Arg 


Ala 


Leu 


Leu 


Ser 


Ala 


Gly 


Gly 


Gly 


His 


Ala 


Arg 


He 


Tyr 


Pro 








245 










250 










o a c 
ZOO 




Asp 


Gly 


Arg 


Glu 


Leu 


Val 


Thr 


Pro 


Leu 


Gly 


Glu 


Asn 


Glu 


Glu 


GJ_y 


Leu 




260 










265 










270 




A±a 


Leu 


lie 


Ala 
275 


Glu 


Leu 


Asp 


Ser 


Ala 
280 


Ala 


He 


Thr 


Phe 


Ala 
285 


Lys 


Leu 


Ala 


Asp 


Pro 


Val 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 


Val 


Thr 


Arg 


Leu 


Leu 




290 










295 










300 








Pro 


Phe 


Asn 


Pro 


Ser 


Ala 


Asn 


Lys 


Thr 


Val 


He 


Lys 


Arg 


His 


Ser 


Pro 


305 










310 








315 




Glu 






320 


Glu 


Leu 


lie 


Ala 


Glu 
325 


Gin 


Ala 


Pro 


Glu 


Glu 
330 


Glu 


Glu 









<210> 293 
<211> 1008 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<400> 293 ^ 
atgaaaaata tcaaaaactc agaaaaaagc agcacagtaa gagtcgctgc ggtacaaatc 
agtccggtgt tgtacaaccg cgaagctacc gttcaaaaag tagtcaacaa aatccttgaa 
ctaggaaaac aaggggtaca attcgccact tttccggaaa cgatagtgcc ttattatcct 
tatttctctt ttattcaggc gccttatgcc atgggcaaag aacacctgcg cttgcttgaa 
caatcagtta ctgttccgtc agccgcgacc gatgccataa gtgaggcggc aaaggaagcc 
aatatggtag tgtctattgg tgtcaatgaa' cgagacggtg gtaccattta caatacgcaa 
ctcctttttg atgctgacgg aacattaatt cagcgcagac gtaaacttac accaacgtat 
catgaaagaa tgatttgggg acaaggtgac gcttcaggtc ttcgtgccac agacagcgct 
gttgggcgta tcgggcagtt ggcttgttgg gaacattaca atccattgtt ccgttatgct 
ttgattgctg atggagaaca aatccattct gccatgtatc ccggatcatt tttaggtgcg 
ttgcacggtg aacaaaccga aatcaatgta cgccaacacg ctttagaatc ggccagcttc 
gtcgtagtgg ctaccggttg gttggatgcc gatcaacaag cacaaattgc gaaagacacc 
ggtggaccaa tcggaccaat ttcgggaggt tgttttacag ccgttatagg ccctgacgga 
caactaatcg gggaagccct tacatcaggt gaaggggaag tgattgccga tattgatttg 
gcacaaattg atgcccgcaa aagattaatg gatgccagtg gtcactacaa ccgtcctgaa 
ttgttgagct tgcatatcga tcacactccg actgctccta tgcatgaaag agtagtttac 
actgagccgg gattagcaaa aagacaaaat gaaaattcat caaattaa 

<210> 294 
<211> 335 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<400> 294 _ 
Met Lys Asn He Lys Asn Ser Glu Lys Ser Ser Thr Val Arg Val Ala 

15 10 15 

Ala Val Gin He Ser Pro Val Leu Tyr Asn Arg Glu Ala Thr Val Gin 

20 25 30 

Lys Val Val Asn Lys He Leu Glu Leu Gly Lys Gin Gly Val Gin Phe 

35 40 45 

Ala Thr Phe Pro Glu Thr He Val Pro Tyr Tyr Pro Tyr Phe Ser Phe 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900* 
960 
1008 
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50 55 60 

lie Gin Ala Pro Tyr Ala Met Gly Lys Glu His Leu Arg Leu Leu Glu 
65 70 75 80 

Gin Ser Val Thr Val Pro Ser Ala Ala Thr Asp Ala He Ser Glu Ala 

85 90 95 

Ala Lys Glu Ala Asn Met Val Val Ser He Gly Val Asn Glu Arg Asp 

100 105 HO 

Gly Gly Thr He Tyr Asn Thr Gin Leu Leu Phe Asp Ala Asp Gly Thr 

115 120 125 

Leu He Gin Arg Arg Arg Lys Leu Thr Pro Thr Tyr His Glu Arg Met 

130 135 140 

He Trp Gly Gin Gly Asp Ala Ser Gly Leu Arg Ala Thr Asp Ser Ala 
145 150 155 160 

Val Gly Arg He Gly Gin Leu Ala Cys Trp Glu His Tyr Asn Pro Leu 

165 170 175 

Phe Arg Tyr Ala Leu He Ala Asp Gly Glu Gin lie His Ser Ala Met 

180 185 190 

Tyr Pro Gly Ser Phe Leu Gly Ala Leu His Gly Glu Gin Thr Glu He 

195 200 205 

Asn Val Arg Gin His Ala Leu Glu Ser Ala Ser Phe Val Val Val Ala 

210 215 220 

Thr Gly Trp Leu Asp Ala Asp Gin Gin Ala Gin He Ala Lys Asp Thr 
225 230 235 240 

Gly Gly Pro He Gly Pro He Ser Gly Gly Cys Phe Thr Ala Val He 

245 250 255 

Gly Pro Asp Gly Gin Leu He Gly Glu Ala Leu Thr Ser Gly Glu Gly 

260 265 270 

Glu Val He Ala Asp He Asp Leu Ala Gin He Asp Ala Arg Lys Arg 

275 280 285 

Leu Met Asp Ala Ser Gly His Tyr Asn Arg Pro Glu Leu Leu Ser Leu 

290 295 300 

His He Asp His Thr Pro Thr Ala Pro Met His Glu Arg Val Val Tyr 
305 310 315 320 

Thr Glu Pro Gly Leu Ala Lys Arg Gin Asn Glu Asn Ser Ser Asn 
325 330 335 



<210> 295 
<211> 1134 
<212> DMA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 295 

atggccgcaa 

tcagtcttta 

ggcaaagagg 

tggaccatgg 

gacaattccg 

gctggcgcct 

ttgttcaact 

ttaatcccgt 

gtttacgaca 

aacatcaccg 

atgtggaact 

aaatgtgacc 

tcagccaaca 

atgaactatg 

attgttgcgc 

tatgcagaac 



aagtacttgg 
tggacaagaa 
gcgcggagct 
gatgggatac 
tggtcgtcgg 
acgtcgtcat 
cgctggtcta 
cctttatcga 
cggaaattgg 
catggtacat 
gtggcgcaga 
tcatccctgc 
acctactgcg 
gcggtccggg 
caacgtctga 
ttcaggcgaa 



aggacgtgac 
agcctgtctc 
tgttgttttt 
catcgcccat 
gagcgaagac 
gggctgtaat 
tatggacaaa 
acgcatctgg 
gcgcatcggc 
tgcccaaggc 
agaaggagaa 
tacacgcgga 
gatggaagac 
ccaggaggat 
atttatggtg 
atacatcaaa 



acagtaaaag 
gaacttgcct 
cctgaaacct 
ggcttccatg 
accgacatat 
gagctcgatg 
tatggcggcg 
tggggcaatg 
ggtcaaatct 
gtggatattc 
tccttcatat 
cgggcattta 
attcctgacg 
tttgtcggat 
ccgccgacgt 
gtggtgaagt 



tagcagttgt 
gcgataagat 
ggattccgac 
atgtgatggc 
tgggcaaagc 
accgggtcgg 
tgctcggccg 
gggacagccg 
gctgggaaaa 
atgtctcggt 
acgccggtca 
cgggtcagtg 
atttcccgtt 
gggcttgcgg 
ttgatataga 
cggtcttcga 



tcaaacacca 
tatcgaagcc 
atatccttat 
ggacctgcag 
cgcccgggaa 
cagcaggacc 
tcaccgtaaa 
cgatctcaaa 
tcacattgtg 
ttggccggga 
cgacatcaac 
ttatgtcctc 
ccgggatacc 
tggcagccat 
caccatcatc 
ttccctcggc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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cactacgcgc ggtgggacct cgtcaacttg accacgccgc caccgccgta tgaacctgaa 1020 
accgacgcac cagctctcac ggccgatatc cgtgatcggg tcatcgagag tgtggctaaa 1080 
gagttcaagc tcgaaccaga aaaagtggct gaagttgtgc gcaatgccgc ctag 1134 

<210> 296 
<211> 377 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<400> 296 , ^ _ 

Met Ala Ala Lys Val Leu Gly Gly Arg Asp Thr Val Lys Val Ala Val 

15 10 15 

Val Gin Thr Pro Ser Val Phe Met Asp Lys Lys Ala Cys Leu Glu Leu 

20 25 30 

Ala Cys Asp Lys He He Glu Ala Gly Lys Glu Gly Ala Glu Leu Val 

35 40 45 

Val Phe Pro Glu Thr Trp He Pro Thr Tyr Pro Tyr Trp Thr Met Gly 

50 55 60 

Trp Asp Thr He Ala His Gly Phe His Asp Val Met Ala Asp Leu Gin 
65 70 75 80 

Asp Asn Ser Val Val Val Gly Ser Glu Asp Thr Asp He Leu Gly Lys 

85 90 95 

Ala Ala Arg Glu Ala Gly Ala Tyr Val Val Met Gly Cys Asn Glu Leu 

100 105 HO 

Asp Asp Arg Val Gly Ser Arg Thr Leu Phe Asn Ser Leu Val Tyr Met 

115 120 125 

Asp Lvs Tyr Gly Gly Val Leu Gly Arg His Arg Lys Leu He Pro Ser 

P 130 135 140 

Phe He Glu Arg He Trp Trp Gly Asn Gly Asp Ser Arg Asp Leu Lys 
145 150 155 160 

Val Tyr Asp Thr Glu He Gly Arg He Gly Gly Gin He Cys Trp Glu 

165 170 175 

Asn His He Val Asn He Thr Ala Trp Tyr He Ala Gin Gly Val Asp 

180 185 190 

He His Val Ser Val Trp Pro Gly Met Trp Asn Cys Gly Ala Glu Glu 

195 200 205 

Gly Glu Ser Phe He Tyr Ala Gly His Asp He Asn Lys Cys Asp Leu 

210 215 220 

He Pro Ala Thr Arg Gly Arg Ala Phe Thr Gly Gin Cys Tyr Val Leu 
225 230 235 240 

Ser Ala Asn Asn Leu Leu Arg Met Glu Asp He Pro Asp Asp Phe Pro 

245 250 255 

Phe Arg Asp Thr Met Asn Tyr Gly Gly Pro Gly Gin Glu Asp Phe Val 

260 265 270 

Gly Trp Ala Cys Gly Gly Ser His He Val Ala Pro Thr Ser Glu Phe 

275 280 285 

Met Val Pro Pro Thr Phe Asp He Asp Thr He He Tyr Ala Glu Leu 

290 295 300 

Gin Ala Lys Tyr He Lys Val Val Lys Ser Val Phe Asp Ser Leu Gly 
305 310 315 320 

His Tyr Ala Arg Trp Asp Leu Val Asn Leu Thr Thr Pro Pro Pro Pro 

325 330 335 

Tyr Glu Pro Glu Thr Asp Ala Pro Ala Leu Thr Ala Asp He Arg Asp 

340 345 350 

Arg Val He Glu Ser Val Ala Lys Glu Phe Lys Leu Glu Pro Glu Lys 

355 360 365 

Val Ala Glu Val Val Arg Asn Ala Ala 
370 375 
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<210> 297 
<211> 1059 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 297 

atgacacgtt 

cgggaggcct 

gccctcgcgg 

gcgcacaacc 

attccgagtc 

gtaatcggcg 

ttcatcggga 

gagcgaacgg 

ggccgcctca 

gccgcgcagg 

ccgccgaacg 

gcgggcgcgt 

tatcgcgaac 

ggcgaggtca 

gaagccctct 

atttttcagc 

ctcacgcccc 

gccttcgggc 



ttcgggacgt 
ccacagataa 
cgttcggcga 
ggagcctgtt 
caacgacgga 
tcgtagaact 
gagagggggc 
tgtggggtga 
gcgggctgaa 
gcacgcagtt 
aggcggctta 
acgtgatcag 
tgggacagtc 
tcgcggggcc 
tcgccgcaaa 
tgcgtgtcaa 
cagtcgccgt 
catccagcct 



cacggtggcg 
ggcgtgccaa 
aacgtggttg 
ctggaatgcg 
ccgcctctgc 
ggatggacga 
gatcctgggg 
aggggacgct 
ttgctgggaa 
tcacgtcgcg 
tacgcgccag 
cgtcgcgggg 
ctatgagttg 
cgcgaagggc 
gtccgccatt 
cgatcagcta 
agttgtcggg 
cctggccacg 



gcggttcagg 
ttgattcacg 
ccgggatatc 
gccgccgagt 
gccgcagcga 
acgcgagcgt 
cgccaccgca 
cacgggctcc 
cacaacatga 
acatggcctg 
cttctcctct 
ctgctcggac 
accggcgaca 
gagaccatcc 
gacgtcggcg 
cagcatcagg 
cctgagggta 
acaagctag 



ccgcacccgt 
aagcggcgaa 
cgttctttgc 
acatcgccaa 
agatcgccgg 
cggtttacag 
aattgaagcc 
gcgttcacga 
tgctgcccgg 
ggaaagagag 
ctcgcgccta 
ccgactcgat 
gcgtcatcat 
tgctcgcgca 
gtcattactc 
tccggagact 
gctcacatga 



ctatttcgat 
gaaaggcgca 
atgggggttc 
tgctgtggag 
gattgacgtg 
cacactgctg 
aacccacatg 
gcgtccgtac 
ctatgtgctt 
gctcacagtt 
tgcatcccag 
gccggagcgt 
cgatccgcgc 
atgcagccag 
gcgcccggat 
cgaggcgact 
gcaggagacg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1059 



<210> 298 
<211> 352 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
<400> 298 



Met 


Thr 


Arg 


Phe 


Arg 


Asp 


Val 


Thr 


Val 


Ala 


Ala 


Val 


Gin 


Ala 


Ala 


Pro 


1 






5 










10 










15 




Val 


Tyr 


Phe 


Asp 


Arg 


Glu 


Ala 


Ser 


Thr 


Asp 


Lys 


Ala 


Cys 


Gin 


Leu 


He 






20 










25 










30 






His 


Glu 


Ala 
35 


Ala 


Lys 


Lys 


Gly 


Ala 
40 


Ala 


Leu 


Ala 


Ala 


Phe 
45 


Gly 


Glu 


Thr 


Trp 


Leu 


Pro 


Gly 


Tyr 


Pro 


Phe 


Phe 


Ala 


Trp 


Gly 


Phe 


Ala 


His 


Asn 


Arg 


50 










55 










60 










Ser 


Leu 


Phe 


Trp 


Asn 


Ala 


Ala 


Ala 


Glu 


Tyr 


He 


Ala 


Asn 


Ala 


Val 


Glu 


65 








70 










75 










80 


He 


Pro 


Ser 


Pro 


Thr 
85 


Thr 


Asp 


Arg 


Leu 


Cys 
90 


Ala 


Ala 


Ala 


Lys 


He 
95 


Ala 


Gly 


He 


Asp 


Val 


Val 


He 


Gly 


Val 


Val 


Glu 


Leu 


Asp 


Gly Arg 


Thr 


Arg 




100 










105 










110 






Ala 


Ser 


Val 


Tyr 


Ser 


Thr 


Leu 


Leu 


Phe 


He 


Gly Arg 


Glu 


Gly Ala 


He 






115 








120 










125 








Leu 


Gly 
130 


Arg 


His 


Arg 


Lys 


Leu 
135 


Lys 


Pro 


Thr 


His 


Met 
140 


Glu 


Arg 


Thr 


Val 


Trp 


Gly 


Glu 


Gly Asp 


Ala 


His 


Gly 


Leu 


Arg 


Val 


His 


Glu 


Arg 


Pro 


Tyr 


145 








150 










155 










160 


Gly 


Arg 


Leu 


Ser 


Gly 


Leu 


Asn 


Cys 


Trp 


Glu 


His 


Asn 


Met 


Met 


Leu 


Pro 






165 










170 










175 




Gly 


Tyr 


Val 


Leu 


Ala 


Ala 


Gin 


Gly 


Thr 


Gin 


Phe 


His 


Val 


Ala 


Thr 


Trp 
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180 185 190 



Pro 


Gly 


Lys 


Glu 


Arg 


Leu 


Thr 


Val 


Pro 


Pro 


Asn 


Glu 


Ala 


Ala 


Tyr 


Thr 




195 










200 










205 








Arg 


Gin 


Leu 


Leu 


Leu 


Ser 


Arg 


Ala 


Tyr 


Ala 


Ser 


Gin 


Ala 


Gly 


Ala 


Tyr 


210 










215 










220 










Val 


He 


Ser 


Val 


Ala 


Glv 


Leu 


Leu 


Gly 


Pro 


Asp 


Ser 


Met 


Pro 


Glu 


Arg 


225 










230 










235 










240 




Arg 


Glu 


Leu 


Gly 


Gin 


Ser 


Tvr 


Glu 


Leu 


Thr 


Gly 


Asp 


Ser 


Val 


He 






245 










250 










255 




J. J.e 


Asp 


Piro 


r-i-L y 


oiy 


Glu 


Val 


He 


Ala 


Glv 


Pro 


Ala 


Lys 


Gly 


Glu 


Thr 






260 










265 










270 






He 


Leu 


Leu 


Ala 


Gin 


Cys 


Ser 


Gin 


Glu 


Ala 


Leu 


Phe 


Ala 


Ala 


Lys 


Ser 






275 










280 










285 








Ala 


He 


Asp 


Val 


Gly 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 


He 


Phe 


Gin 


Leu 




290 








295 










300 










Arg 


Val 


Asn 


Asp 


Gin 


Leu 


Gin 


His 


Gin 


Val 


Arg 


Arg 


Leu 


Glu 


Ala 


Thr 


305 








310 










315 










320 


Leu 


Thr 


Pro 


Pro 


Val 


Ala 


Val 


Val 


Val 


Gly 


Pro 


Glu 


Gly 


Ser 


Ser 


His 










325 










330 










335 




Glu 


Gin 


Glu 


Thr 


Ala 


Phe 


Gly 


Pro 


Ser 


Ser 


Leu 


Leu 


Ala 


Thr 


Thr 


Ser 



340 345 350 



<210> 299 
<211> 987 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 299 

atgactgttg 

accgtcgaga 

gtcttccctg 

aaactggcca 

acggacgcca 

gaacgtgatg 

atccagcgcc 

gacggatcag 

tgggagcact 

gccgcgatgt 

gtccgtcagc 

gccgaccagc 

ggcttcttca 

ggcgaaggcg 

atggacgcca 

ccggcacccg 

aagggccgct 



ttaaggccgc 
aggtcgtgaa 
aaaccgttgt 
cggagcacct 
tcggcgacgc 
gcggcaccat 
gccgcaagat 
gcttgcgcgc 
accagccgct 
accccggcgc 
acgctctgga 
aggcccagat 
cagccatcat 
aagtcatcgc 
gcggccacta 
ccgtccacga 
cagccgaggc 



cgcagtgcag 
gaagatcgac 
cccctactac 
gcgcctgctt 
cgcccgcaag 
ttacaacacc 
cacgccgacc 
ggtcgacagc 
ggcccgttac 
cttcgggggc 
atcggccagt 
tgcgaaggac 
cgaccccgac 
tgacctcgac 
cagccgcccc 
tcgcaatcgc 
caagtga 



atcagcccgg 
gagctgggcc 
ccctacttct 
gaggagtcgg 
gccaacatgg 
caactcctgt 
taccacgagc 
gtcgtcggcc 
gctctcatcg 
gatctgttcg 
ttcgtcgtca 
accggcggcc 
ggccgcatca 
tttgcgctca 
gaactgctca 
caggggtcct 



tgctctacag 
agaagggtgt 
ccttcgtgca 
tgaccgtgcc 
tcgtctcgat 
tcgacgccga 
ggatgatctg 
gcatcggcca 
ccgacggcga 
cagagcagat 
gcgccaccgc 
ccgtacaggc 
tcggcgaacc 
tcgaccgccg 
gcctgcagat 
caagcgctcc 



ccgtgcagga 
cgagtttgcc 
gcccccctac 
ctctgccgag 
cggtgtcaac 
cggaaccctg 
gggacaggga 
gctcgcctgc 
gcagatccac 
cgaagtcaat 
ctggctcgac 
gatctccggc 
gatcacctcc 
caagcgcctg 
cgaccggacg 
ggcaactgaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
987 



<210> 300 
<211> 328 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 300 

Met Thr Val Val Lys Ala Ala Ala Val Gin He Ser Pro Val Leu Tyr 

1 5 10 15 

Ser Arg Ala Gly Thr Val Glu Lys Val Val Lys Lys He Asp Glu Leu 
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20 

Gly Gin Lys Gly 
35 

Tyr Tyr Pro Tyr 
50 

Glu His Leu Arg 
65 

Thr Asp Ala lie 

lie Gly Val Asn 
100 

Leu Phe Asp Ala 
115 

Pro Thr Tyr His 
130 

Leu Arg Ala Val 
145 

Trp Glu His Tyr 

Glu Gin lie His 
180 

Phe Ala Glu Gin 
195 

Ala Ser Phe Val 
210 

Ala Gin lie Ala 

225 

Gly Phe Phe Thr 

Pro lie Thr Ser 
260 

Leu lie Asp Arg 
275 

Arg Pro Glu Leu 
290 

Val His Asp Arg 
305 

Lys Gly Arg Ser 



Val Glu Phe Ala 
40 

Phe Ser Phe Val 
55 

Leu Leu Glu Glu 
70 

Gly Asp Ala Ala 
85 

Glu Arg Asp Gly 

Asp Gly Thr Leu 
120 

Glu Arg Met lie 
135 

Asp Ser Val Val 
150 

Gin Pro Leu Ala 
165 

Ala Ala Met Tyr 

lie Glu Val Asn 
200 

Val Ser Ala Thr 
215 

Lys Asp Thr Gly 

230 

Ala He He Asp 
245 

Gly Glu Gly Glu 

Arg Lys Arg Leu 
280 

Leu Ser Leu Gin 
295 

Asn Arg Gin Gly 
310 

Ala Glu Ala Lys 
325 



25 

Val Phe Pro Glu 

Gin Pro Pro Tyr 
60 

Ser Val Thr Val 
75 

Arg Lys Ala Asn 
90 

Gly Thr He Tyr 
105 

He Gin Arg Arg 

Trp Gly Gin Gly 
140 

Gly Arg He Gly 
155 

Arg Tyr Ala Leu 
170 

Pro Gly Ala Phe 
185 

Val Arg Gin His 

Ala Trp Leu Asp 
220 

Gly Pro Val Gin 

235 

Pro Asp Gly Arg 
250 

Val He Ala Asp 
265 

Met Asp Ala Ser 

He Asp Arg Thr 
300 

Ser Ser Ser Ala 
315 



30 

Thr Val Val Pro 
45 

Lys Leu Ala Thr 

Pro Ser Ala Glu 
80 

Met Val Val Ser 
95 

Asn Thr Gin Leu 
110 

Arg Lys He Thr 
125 

Asp Gly Ser Gly 

Gin Leu Ala Cys 
160 

He Ala Asp Gly 
175 

Gly Gly Asp Leu 
190 

Ala Leu Glu Ser 
205 

Ala Asp Gin Gin 

Ala He Ser Gly 
240 

He He Gly Glu 
255 

Leu Asp Phe Ala 
270 

Gly His Tyr Ser 
285 

Pro Ala Pro Ala 

Pro Ala Thr Glu 
320 



<210> 301 
<211> 1032 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 301 

atgaccacgc ccgcgccgtt caccgccgcc gtcgtccagg cgtcgcccgt gttcctcgac 60 

cgcgatgcca cggtcgataa ggcgtgcgcc ctgatcaccg cggccggagc gcgcggcgcg 120 

aagctcgtcg tgctgcccga gaccttcgtg cccgcatacc ccgcgtgggt ctggtacctg 180 

ccgctcacgc gtcggcccga cgtcgcgaag ctctaccgcg cgctcgtcga gaacgcgatc 24 0 

gacgtgccgg gtccggaaac cgagcgcctc gggcgcgcgg cgcgtgacgc gggcgcgtgg 300 

gtggcgatcg gtgtcaacga gcgcaacgcg aaggcgagcc gcacgtcgct ctacaacacc 360 

gtgctcctgt tcgacgacca gggcacgctg gtcgagtcgc ggcgcaagct gatgccgacc 420 

ggcggcgagc gcctggtgtg gacgccgggc gagccggtgc cgttgcgggt tcacgacacg 4 80 

ccgctcggcc gcgtcggcgc gctgatctgc tgggagaact acatgccgct cgcgcgcttc 54 0 

gcgctctacg agcagggcgt ccagatctat ctcgcgccga cctgggacta cagcgaggcg 600 

tggctcgcgt ccatgcggca cgtcgcgcgt gaaggtcgca cgtgggtgat cgggtgcagc 660 

caggcggtgc ggcgtgacga gatcccggac cacctgccgt tcaaggacgc gatccccgag 720 
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agcctcgagt ggatcaacgc cgggaacagc atcgtcgtcg atccggacgg cgcggtggtc 7 80 



gcgggaccgc tggcgcgggc gcacgacacg ctctacgtcg agatcgatcc gggccgcgcc 840 

gccggctccc gatggatctt cgacgccgcc ggtcactatc accgtcccga cctgttccat 900 

ttcgcgatgc gcgcgccggt ggcgccagag accgcgccgg ttgtgcggaa gcgcggccgg 960 

cggctagcct cggccccggc gcggaagcca cggaaacggc cggcgcgtcc cacccgaagg 1020 

aggactcgat ga 1032 

<210> 302 
<211> 335 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 302 

Met Thr Thr Pro Ala Pro Phe Thr Ala Ala Val Val Gin Ala Ser Pro 

15 10 15 

Val Phe Leu Asp Arg Asp Ala Thr Val Asp Lys Ala Cys Ala Leu lie 

20 25 30 

Thr Ala Ala Gly Ala Arg Gly Ala Lys Leu Val Val Leu Pro Glu Thr 

35 40 45 

Phe Val Pro Ala Tyr Pro Ala Trp Val Trp Tyr Leu Pro Leu Thr Arg 

50 55 60 

Arg Pro Asp Val Ala Lys Leu Tyr Arg Ala Leu Val Glu Asn Ala lie 
65 70 75 80 

Asp Val Pro Gly Pro Glu Thr Glu Arg Leu Gly Arg Ala Ala Arg Asp 

85 90 95 

Ala Gly Ala Trp Val Ala lie Gly Val Asn Glu Arg Asn Ala Lys Ala 

100 105 110 

Ser Arg Thr Ser Leu Tyr Asn Thr Val Leu Leu Phe Asp Asp Gin Gly 

115 120 125 

Thr Leu Val Glu Ser Arg Arg Lys Leu Met Pro Thr Gly Gly Glu Arg 

130 135 140 

Leu Val Trp Thr Pro Gly Glu Pro Val Pro Leu Arg Val His Asp Thr 
145 150 155 160 

Pro Leu Gly Arg Val Gly Ala Leu lie Cys Trp Glu Asn Tyr Met Pro 

165 170 175 

Leu Ala Arg Phe Ala Leu Tyr Glu Gin Gly Val Gin lie Tyr Leu Ala 

180 185 190 

Pro Thr Trp Asp Tyr Ser Glu Ala Trp Leu Ala Ser Met Arg His Val 

195 200 205 

Ala Arg Glu Gly Arg Thr Trp Val lie Gly Cys Ser Gin Ala Val Arg 

210 215 220 

Arg Asp Glu lie Pro Asp His Leu Pro Phe Lys Asp Ala lie Pro Glu 
225 230 235 240 

Ser Leu Glu Trp He Asn Ala Gly Asn Ser He Val Val Asp Pro Asp 

245 250 255 

Gly Ala Val Val Ala Gly Pro Leu Ala Arg Ala His Asp Thr Leu Tyr 

260 265 270 

Val Glu He Asp Pro Gly Arg Ala Ala Gly Ser Arg Trp He Phe Asp 

275 280 285 

Ala Ala Gly His Tyr His Arg Pro Asp Leu Phe His Phe Ala Met Arg 

290 295 300 

Ala Pro Val Ala Pro Glu Thr Ala Pro Val Val Arg Lys Arg Gly Arg 
305 310 315 320 

Arg Leu Ala Ser Ala Pro Ala Arg Lys Pro Arg Lys Arg Pro Ala 
325 330 335 

<210> 303 
<211> 1011 
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<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 303 

atgggcatcg 

gacctggagg 

tgcaagctca 

ggaacgcctg 

tcttatgaca 

gccgtactag 

ggccctgacg 

acggtatttg 

ctgggcgcgc 

cagaatgagc 

catgcattgg 

tgttttgtat 

acacccgata 

gatggcagtc 

attgatctcg 

cgacctgacg 

tctctgccca 



ttcaccccaa 
ccactgttga 
tcgccttccc 
cctggactat 
gtccgcaagc 
gtctttctga 
gagaaaccat 
gcgacggtga 
ttgcgtgctg 
aggtgcacgt 
gttgggaggt 
tggcgccttg 
agcaccaact 
ctctggcaga 
gtctcatcag 
tacatcgctt 
ttgatttggc 



gtacaaagtc 
taaatgcatt 
tgagaccttc 
tcagcgcggc 
ggagaagcta 
gcgcagcggc 
acacaaacgc 
cggtagtgat 
ggagaacata 
agccgcttgg 
caataacgca 
cgcagtggtc 
ggctcatgcg 
taagttaccc 
cttggcgaaa 
gctattgaat 
agagacaact 



gctgtcgttc 
cagttgattg 
attcccggat 
tttgtacagc 
aggcaggcag 
ggtagcttgt 
agaaaagtgc 
cttgcggtgc 
ctgtcactga 
cctagcttct 
gtcagcaagg 
tccgaggaaa 
ggtggagggc 
gaaggagagg 
aatgccatgg 
cgcaatccag 
ccgccaatat 



aggctgcccc 
aagaagcagc 
acccttggta 
gctattttga 
tcaaaaaggc 
acattgcaca 
gtccaacgca 
acgatacccc 
acaagtatgc 
cggtctacga 
tttacgcggt 
tgatcgaagc 
atgctgtcat 
aggggctatt 
acccggtggg 
caaagcgggt 
taggcacgta 



tgtatggcta 
cagcaagggc 
tatctggatg 
taattctctg 
tgagatcacg 
atggaccatt 
tggtgagcgt 
cctggggcgg 
gatgtattca 
gcctttcgcc 
agaaggcggc 
actgtgcgat 
ttacggacca 
aattgcagaa 
gcattactct 
tgaggaattt 

g 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1011 



<210> 304 
<211> 336 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 304 




























Met 


Gly 


lie 


Val 


His 


Pro 


Lys 


Tyr 


Lys 


Val 


Ala 


Val 


Val 


Gin 


Ala 


Ala 


1 






5 










10 










15 




Pro 


Val 


Trp 


Leu 


Asp 


Leu 


Glu 


Ala 


Thr 


Val 


Asp 


Lys 


Cys 


He 


Gin 


Leu 






20 








25 










30 






lie 


Glu 


Glu 


Ala 


Ala 


Ser 


Lys 


Gly 


Cys 


Lys 


Leu 


He 


Ala 


Phe 


Pro 


Glu 






35 










40 










45 








Thr 


Phe 


He 


Pro 


Gly 


Tyr 


Pro 


Trp 


Tyr 


He 


Trp 


Met 


Gly 


Thr 


Pro 


Ala 




50 










55 










60 










Trp 


Thr 


He 


Gin 


Arg 


Gly 


Phe 


Val 


Gin 


Arg 


Tyr 


Phe 


Asp 


Asn 


Ser 


Leu 


65 










70 










75 










80 


Ser 


Tyr 


Asp 


Ser 


Pro 


Gin 


Ala 


Glu 


Lys 


Leu 


Arg 


Gin 


Ala 


Val 


Lys 


Lys 






85 










90 










95 




Ala 


Glu 


He 


Thr 


Ala 


Val 


Leu 


Gly 


Leu 


Ser 


Glu 


Arg 


Ser 


Gly 


Gly 


Ser 








100 










105 










110 






Leu 


Tyr 


He 


Ala 


Gin 


Trp 


Thr 


He 


Gly 


Pro 


Asp 


Gly 


Glu 


Thr 


He 


His 




115 










120 










125 








Lys 


Arg 


Arg 


Lys 


Val 


Arg 


Pro 


Thr 


His 


Gly 


Glu 


Arg 


Thr 


Val 


Phe 


Gly 


130 










135 










140 










Asp 


Gly 


Asp 


Gly 


Ser 


Asp 


Leu 


Ala 


Val 


His 


Asp 


Thr 


Pro 


Leu 


Gly 


Arg 


145 








150 










155 










160 


Leu 


Gly 


Ala 


Leu 


Ala 


Cys 


Trp 


Glu 


Asn 


He 


Leu 


Ser 


Leu 


Asn 


Lys 


Tyr 








165 








170 










175 




Ala 


Met 


Tyr 


Ser 


Gin 


Asn 


Glu 


Gin 


Val 


His 


Val 


Ala 


Ala 


Trp 


Pro 


Ser 






180 










185 










190 






Phe 


Ser 


Val 


Tyr 


Glu 


Pro 


Phe 


Ala 


His 


Ala 


Leu 


Gly 


Trp 


Glu 


Val 


Asn 






195 








200 










205 








Asn 


Ala 


Val 


Ser 


Lys 


Val 


Tyr 


Ala 


Val 


Glu 


Gly 


Gly 


cys 


Phe 


Val 


Leu 
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210 










215 










220 










Ala 


Pro 


Cys 


Ala 


Val 


Val 


Ser 


Glu 


Glu 


Met 


He 


Glu 


Ala 


Leu 


Cys 


Asp 


225 








230 










235 










240 


Thr 


Pro 


Asp 


Lys 


His 


Gin 


Leu 


Ala 


His 


Ala 


Gly 


Gly 


Gly 


His 


Ala 


Val 






245 










250 










255 




lie 


Tyr 


Gly 


Pro 


Asp 


Gly 


Ser 


Pro 


Leu 


Ala 


Asp 


Lys 


Leu 


Pro 


Glu 


Gly 






260 










265 










270 






Glu 


Glu 


Gly 

275 


Leu 


Leu 


lie 


Ala 


Glu 
280 


lie 


Asp 


Leu 


Gly 


Leu 
285 


He 


Ser 


Leu 


Ala 


Lys 
290 


Asn 


Ala 


Met 


Asp 


Pro 
295 


Val 


Gly 


His 


Tyr 


Ser 
300 


Arg 


Pro 


Asp 


Val 


His 


Arg 


Leu 


Leu 


Leu 


Asn 


Arg 


Asn 


Pro 


Ala 


Lys 


Arg 


Val 


Glu 


Glu 


Phe 


305 








310 










315 










320 


Ser 


Leu 


Pro 


lie 


Asp 
325 


Leu 


Ala 


Glu 


Thr 


Thr 
330 


Pro 


Pro 


He 


Leu 


Gly 
335 


Thr 



<210> 305 
<211> 1068 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 305 

atgacgatgc 

gaccttgatg 

gcgcaattga 

gattcgccgg 

ggcacaccgc 

atggggcaca 

gatggggaaa 

tttggggaag 

gcactatgct 

gagcaggtcc 

ctcggcgccg 

gtgatcgcgc 

atgaaaaagc 

cagatgatgc 

ctgggcatga 

gacgtaaccc 

cgcggcaagg 

gacacgctga 



aacatcccaa 
cgtcgataga 
ttgcctttcc 
cctggggcat 
aggccgagcg 
gcgagcggga 
cggttgcgac 
gcgatggcag 
gctgggagca 
acattgcttc 
aagtgaacaa 
cgtgcggggt 
agttgttggt 
acgcgccgct 
tctcgctggc 
ggttactgct 
aggtcagccg 
cctcgccgaa 



gtttcgcgct 
taaagccatc 
ggaaacctgg 
gcaattcatc 
catcgcgcag 
tcatggaagt 
acgtcgtaag 
tgacctgagc 
cctccagccc 
ctggccaagc 
tgcggccagc 
cgtcacgaaa 
tgaaggcggc 
ggcggaaaac 
caaagtagtc 
ggacaagact 
cgctggtaac 
gccagcgtct 



gctgccgtgc 
gacctgatcg 
ctgcccggct 
cagcgctacc 
gctgcgaaaa 
ctgtacatcg 
ctcaaaccga 
gtgttcgata 
ctgtcgaaat 
ttttccctgt 
cagatttatg 
gaaatgctgg 
gggttcgcgc 
gaagagggcc 
gccgacccgg 
ccgggagacc 
gacgagccgc 
tcgcgcaaag 



aggcggcacc 
cgcaagccgc 
accccttttt 
atgacaattc 
agcatcgcat 
ctcagtggat 
ctcatgccga 
caccgctggg 
acgcgctata 
atcgcggggg 
ctgtcgaagg 
acgtgctgtg 
gaatttacgc 
tggtgtatgc 
ccgggcatta 
gcgtcatgct 
aagtgctggt 
caagctga 



ggtcttcctt 
caaaggcggc 
catctggctc 
cctggtctac 
catggtcgtc 
catcggtgcc 
gcgcaccttg 
aagggttggc 
cgcacagaat 
cgcctacgcg 
ccagtgtttt 
caccgacgaa 
gcccgatgga 
cgatctcgac 
tgcgcggccc 
ggcgagccgt 
ctcgcgcaac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1068 



<210> 306 
<211> 355 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 306 

Met Thr Met Gin His Pro Lys Phe Arg Ala Ala Ala Val Gin Ala Ala 

15 10 15 

Pro Val Phe Leu Asp Leu Asp Ala Ser He Asp Lys Ala He Asp Leu 

20 25 30 

He Ala Gin Ala Ala Lys Gly Gly Ala Gin Leu He Ala Phe Pro Glu 

35 40 45 

Thr 'Trp Leu Pro Gly Tyr Pro Phe Phe He Trp Leu Asp Ser Pro Ala 
50 55 60 
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Trp Gly Met Gin Phe He Gin Arg Tyr His Asp Asn Ser Leu Val Tyr 
65 70 75 t 80 

Gly Thr Pro Gin Ala Glu Arg He Ala Gin Ala Ala Lys Lys His Arg 

85 9 0 95 

He Met Val Val Met Gly His Ser Glu Arg Asp His Gly Ser Leu Tyr 

100 105 HO 

He Ala Gin Trp He He Gly Ala Asp Gly Glu Thr Val Ala Thr Arg 

115 120 125 

Arg Lys Leu Lys Pro Thr His Ala Glu Arg Thr Leu Phe Gly Glu Gly 

130 135 140 

Asp Gly Ser Asp Leu Ser Val Phe Asp Thr Pro Leu Gly Arg Val Gly 
145 150 155 160 

Ala Leu Cys Cys Trp Glu His Leu Gin Pro Leu Ser Lys Tyr Ala Leu 

165 l^O 175 

Tyr Ala Gin Asn Glu Gin Val His He Ala Ser Trp Pro Ser Phe Ser 

180 185 190 

Leu Tvr Arg Gly Gly Ala Tyr Ala Leu Gly Ala Glu Val Asn Asn Ala 

195 200 205 

Ala Ser Gin He Tyr Ala Val Glu Gly Gin Cys Phe Val He Ala Pro 

210 215 220 

Cvs Glv Val Val Thr Lys Glu Met Leu Asp Val Leu Cys Thr Asp Glu 
225 230 235 240 

Met Lys Lys Gin Leu Leu Val Glu Gly Gly Gly Phe Ala Arg He Tyr 

245 250 255 

Ala Pro Asp Gly Gin Met Met His Ala Pro Leu Ala Glu Asn Glu Glu 

260 265 270 

Glv Leu Val Tyr Ala Asp Leu Asp Leu Gly Met He Ser Leu Ala Lys 

275 280 285 

Val Val Ala Asp Pro Ala Gly His Tyr Ala Arg Pro Asp Val Thr Arg 
290 295 300 

- Leu Leu Leu Asp Lys Thr Pro Gly Asp Arg Val Met Leu Ala Ser Arg 
305 310 315 320 

Arg Gly Lys Glu Val Ser Arg Ala Gly Asn Asp Glu Pro Gin Val Leu 

325 330 335 

Val Ser Arg Asn Asp Thr Leu Thr Ser Pro Lys Pro Ala Ser Ser Arg 
340 345 350 

Lys Ala Ser 
355 

<210> 307 
<211> 942 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

atggctgaac cggaatcctt tatcgtcgct gctgtacagg ctacaccgat ctttcttgac 
cgccaggcaa cccttgagaa agcgtgcgac ctgattgccg aagctggcag caatggcgca 
aaactcgttc tttttcccga agcctttatc cccacctatc ctgattggat atgggcggtg 
acaggctcac aatctgcgct gctcgacgaa ctttatgtgg aactactgga aaactccgtg 
accatccccg acgcgaccac tgagcaactt tgtgaagcag cacgtaacgc cggtctctac 
gtcgtcatgg gagtgaatga gcgcaacgcc gaggcgagca acgccacact ctataacacc 
ctgctctata ttgacgatca gggcaaaatt ctcggcaagc atcgcaaatt ggtcccgacc 
gccctggagc gaatcgtctg gggctatggc gatggcagca cgcttgacgc ctttgaaacg 
ccgctgggca agattggcgg gctgatctgt tgggaaaatt acatgccact ggcgcgccaa 
acactttatg cctggggggt gcaaatttac ttggccgcaa cgtgggatcg cggcgaagtt 
tggcaggcga ccatgcgcca tattgccagg gaaggcggcg tctatgtagt cgcctcctgt 
attccatttc acatcaaaga cattcctgac cacatgcctg aaatccgcaa tctctatgca 
ccggggacag actggatcaa cgtcggccaa agctgcatca tcaaccccag cggcgactat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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attgcaggcc ctgtcgagtg tcgcgaggag attctttatg ccgaggtaaa tctgcgccag 84 0 

agtgcggcgg caaaacgtat gttggatgtg gcgggccatt atggacgccc tgatgtcttt 900 
cacctcaccg tcaaccgcac gcccaatccg catattcgat aa 942 

<210> 308 
<211> 313 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 308 



Met 


Ala 


Glu 


Pro 


Glu 


Ser 


Phe 


He 


Val 


Ala 


Ala 


Val 


Gin 


Ala 


Thr 


Pro 


1 








5 










10 










15 




He 


Phe 


Leu 


Asp 


Arg 


Gin 


Ala 


Thr 


Leu 


Glu 


Lys 


Ala 


Cys 


Asp 


Leu 


He 








20 










25 










30 






Ala 


Glu 


Ala 


Gly 


Ser 


Asn 


Gly 


Ala 


Lys 


Leu 


Val 


Leu 


Phe 


Pro 


Glu 


Ala 






35 










40 










45 








Phe 


He 


Pro 


Thr 


Tyr 


Pro 


Asp 


Trp 


He 


Trp 


Ala 


Val 


Thr 


Gly 


Ser 


Gin 




50 










55 










60 










Ser 


Ala 


Leu 


Leu 


Asp 


Glu 


Leu 


Tyr 


Val 


Glu 


Leu 


Leu 


Glu 


Asn 


Ser 


Val 


65 










70 










75 










80 


Thr 


He 


Pro 


Asp 


Ala 


Thr 


Thr 


Glu 


Gin 


Leu 


Cys 


Glu 


Ala 


Ala 


Arg 


Asn 








85 










90 










95 




Ala 


Gly 


Leu 


Tyr 


Val 


Val 


Met 


Gly 


Val 


Asn 


Glu 


Arg 


Asn 


Ala 


Glu 


Ala 






100 










105 










110 






Ser 


Asn 


Ala 


Thr 


Leu 


Tyr 


Asn 


Thr 


Leu 


Leu 


Tyr 


He 


Asp 


Asp 


Gin 


Gly 






115 










120 










125 








Lys 


He 


Leu 


Gly 


Lys 


His 


Arg 


Lys 


Leu 


Val 


Pro 


Thr 


Ala 


Leu 


Glu 


Arg 




130 










135 










140 










He 


Val 


Trp 


Gly 


Tyr 


Gly 


Asp 


Gly 


Ser 


Thr 


Leu 


Asp 


Ala 


Phe 


Glu 


Thr 


145 










150 










155 










160 


Pro 


Leu 


Gly 


Lys 


He 


Gly 


Gly 


Leu 


He 


Cys 


Trp 


Glu 


Asn 


Tyr 


Met 


Pro 










165 










170 










175 




Leu 


Ala 


Arg 


Gin 


Thr 


Leu 


Tyr 


Ala 


Trp 


Gly 


Val 


Gin 


He 


Tyr 


Leu 


Ala 








180 










185 










190 






Ala 


Thr 


Trp 


Asp 


Arg 


Gly 


Glu 


Val 


Trp 


Gin 


Ala 


Thr 


Met 


Arg 


His 


He 






195 










200 










205 








Ala 


Arg 


Glu 


Gly 


Gly 


Val 


Tyr 


Val 


Val 


Ala 


Ser 


Cys 


He 


Pro 


Phe 


His 




210 










215 










220 










He 


Lys 


Asp 


He 


Pro 


Asp 


His 


Met 


Pro 


Glu 


He 


Arg 


Asn 


Leu 


Tyr 


Ala 


225 










230 










235 










240 


Pro 


Gly 


Thr 


Asp 


Trp 


He 


Asn 


Val 


Gly 


Gin 


Ser 


Cys 


He 


He 


Asn 


Pro 










245 










250 










255 




Ser 


Gly Asp 


Tyr 


He 


Ala 


Gly 


Pro 


Val 


Glu 


Cys 


Arg 


Glu 


Glu 


He 


Leu 








260 










265 










270 






Tyr 


Ala 


Glu 


Val 


Asn 


Leu 


Arg 


Gin 


Ser 


Ala 


Ala 


Ala 


Lys 


Arg 


Met 


Leu 






275 










280 










285 








Asp 


Val 


Ala 


Gly 


His 


Tyr 


Gly 


Arg 


Pro 


Asp 


Val 


Phe 


His 


Leu 


Thr 


Val 




290 










295 










300 










Asn 


Arg 


Thr 


Pro 


Asn 


Pro 


His 


He 


Arg 

















305 310 



<210> 309 
<211> 951 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
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<400> 309 

ttgaccgact 

gcgaccacga 

gtcgcattcg 

gcgcggttca 

tcgatagcag 

gtcatcctcg 

tgcgtgttca 

tacgaagaac 

gggccgttca 

gcgctgtatg 

acggaagaca 

ggcatcattc 

aaaggcgaaa 

atcgcgcccg 

cgccgcgagc 

ataaccgttg 



gtctcaaaat 
agaaggtcgt 
gcgaagcgct 
attccgacgt 
gcggtcacct 
gcatcgccga 
tcgatgccga 
gcctcagttg 
cggtgggcgc 
ccggcggaga 
tcacgcgctt 
gcgaaagcga 
ccatctacaa 
taaccgaccg 
ggcagaattt 
atcgtcgacg 



agccgccgcc 
cgaaaccatc 
gttgcccgcc 
gcaaaaaaac 
atctccgatt 
gcgcgcgacc 
cggccgaatc 
gggattcggc 
gttgaactgc 
agatttgcac 
catcgcacgc 
catccccagc 
cggcggaagc 
tgaagagttg 
cgacccggcc 
acaaacagcg 



caaatcactc 
gaaaaagcgg 
tatccattgt 
ttgcacgcga 
tgcaaaatcg 
gaccggggcg 
gcgtcggtcc 
gacggtgcgg 
tgggaaaact 
gttgcgatct 
gagtcgcgct 
ggggtcccct 
tgcatcgccg 
atcttcgcgg 
gggcattacg 
gcgaatttta 



cggtcttcct 
ccgccggcgg 
ggctgacgcg 
tctatctcaa 
caagcgaacg 
accacaccat 
atcgcaagct 
gactcgtcac 
ggatgcccct 
ggcccggcgg 
cgttcgtcct 
atcgcgatga 
gacccgacgg 
agatcgacca 
cgcggcccga 
ttgatgacta 



cgaccgcgtt 
tgcgcggctg 
caccgacgcc 
gcaatccgtc 
caagattgcc 
ttactgctcg 
gatgccgaca 
gcatccggtc 
cgcgcgcacc 
atcggtgctc 
gtccgtcagc 
aatgtgtgcg 
tcagtggatc 
cgaacacgtc 
tgtgttgcaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
951 



<210> 310 
<211> 316 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 310 

Leu Thr Asp Cys Leu Lys 

1 5 
Leu Asp Arg Val Ala Thr 
20 

Ala Ala Ala Gly Gly Ala 
35 

Pro Ala Tyr Pro Leu Trp 
50 

Ser Asp Val Gin Lys Asn 
65 70 
Ser lie Ala Gly Gly His 
85 

Arg Lys lie Ala Val lie 
100 

Gly Asp His Thr He Tyr 
115 

Arg He Ala Ser Val His 
130 

Leu Ser Trp Gly Phe Gly 
145 150 
Gly Pro Phe Thr Val Gly 
165 

Leu Ala Arg Thr Ala Leu 
180 

He Trp Pro Gly Gly Ser 
195 

Ala Arg Glu Ser Arg Ser 
210 

Glu Ser Asp He Pro Ser 
225 230 
Lys Gly Glu Thr He Tyr 
245 

Gly Gin Trp He He Ala 



He 

Thr 

Arg 

Leu 
55 
Leu 

Leu 

Leu 

Cys 

Arg 
135 
Asp 

Ala 

Tyr 

Val 

Phe 
215 
Gly 

Asn 

Pro 



Ala 

Lys 

Leu 
40 
Thr 

His 

Ser 

Gly 

Ser 
120 
Lys 

Gly 

Leu 

Ala 

Leu 
200 
Val 

Val 

Gly 

Val 



Ala Ala 

10 
Lys Val 
25 

Val Ala 

Arg Thr 

Ala He 

Pro He 
90 

He Ala 
105 

Cys Val 

Leu Met 

Ala Gly 

Asn Cys 
170 
Gly Gly 
185 

Thr Glu 

Leu Ser 

Pro Tyr 

Gly Ser 
250 
Thr Asp 



Gin He 

Val Glu 

Phe Gly 

Asp Ala 
60 

Tyr Leu 
75 

Cys Lys 

Glu Arg 

Phe He 

Pro Thr 
140 
Leu Val 
155 

Trp Glu 

Glu Asp 

Asp He 

Val Ser 
220 
Arg Asp 
235 

Cys He 



Thr 

Thr 

Glu 

45 

Ala 

Lys 

He 

Ala 

Asp 
125 
Tyr 

Thr 

Asn 

Leu 

Thr 
205 
Gly 



Glu 
Ala 

Arg Glu Glu 



Pro 

He 
30 
Ala 

Arg 

Gin 

Ala 

Thr 
110 
Ala 

Glu 

His 

Trp 

His 
190 
Arg 

He 

Met 

Gly 

Leu 



Val Phe 
15 

Glu Lys 

Leu Leu 

Phe Asn 

Ser Val 

80 
Ser Glu 
95 

Asp Arg 

Asp Gly 

Glu Arg 

Pro Val 
160 
Met Pro 
175 

Val Ala 

Phe He 

He Arg 

Cys Ala 
240 
Pro Asp 
255 

He Phe 
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260 265 270 

Ala Glu He Asp His Glu His Val Arg Arg Glu Arg Gin Asn Phe Asp 

275 280 285 

Pro Ala Gly His Tyr Ala Arg Pro Asp Val Leu Gin He Thr Val Asp 

290 295 300 

Arg Arg Arg Gin Thr Ala Ala Asn Phe He Asp Asp 
305 310 315 



<210> 311 
<211> 1011 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 311 

atgtcagaaa 

gcagatggaa 

caaattatag 

cccccagttt 

ggtcctgtca 

ggtgtaaacg 

ggtgagttga 

ggacaaggtg 

ctagcttgct 

gaaattcact 

gatgtcacta 

tggctcacag 

cgtggcggct 

cgtgaaggtg 

agaatgatgg 

gacaagccag 

cgaatcgcag 



agcgaataat 
cagttaagaa 
tattccccga 
ctgctggcgc 
ctcaagcgat 
agagagataa 
tgttgaagag 
atgcttcagg 
gggaacatta 
gcgcacaatt 
tcagacatca 
acgagcaaat 
gcaacacagc 
aaggcatttt 
attcagtagg 
caacaacaaa 
aggagttgaa 



cagagcagct 
ggtatgcaag 
aaccttcatt 
tgagcatttg 
cgccgagaga 
cggtagcctc 
aagaaaaatc 
cttaaaagta 
caacccgctg 
cccaggttct 
cgcattggaa 
ccagtccatt 
aatcatttct 
gattgctgac 
tcattattca 
attttcaatg 
aaatgacgac 



gcagttcaga 
gtaatcgatg 
ccgtattacc 
aagctttatg 
gctagagtga 
tataacactc 
actcctacat 
gttgaaacga 
gccagatatt 
atggttggcc 
tctggctgtt 
acagatgacc 
cccgaaggtg 
ctggacatgt 
agacctgaac 
actgaggggt 
aagcttagca 



tcacacctga 
aagcaggcgc 
catacttctc 
aaaaaagtgt 
atcaaatggt 
aactgatctt 
atcatgagcg 
gcattgcccg 
ctctcatgac 
aaatatttgc 
tcgtcattaa 
caaaaatgca 
tgcacttaac 
cactcatcac 
tattaagtct 
gtactcaaac 
ccggaaacta 



atttgactca 
aaagggtgta 
attcattacc 
cgtgatacct 
cgtcgtactc 
cgataccaat 
catgatctgg 
ggtaggtgct 
acagcatgaa 
cgaccaaatg 
tgccaccggc 
gaaagcatta 
agagccctta 
aaaacgaaaa 
ggcgatcaat 
tgagcaattt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1011 



<210> 312 
<211> 336 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 312 



Met 


Ser 


Glu 


Lys 


Arg 


He 


He 


Arg 


1 








5 








Glu 


Phe 


Asp 


Ser 


Ala 


Asp 


Gly 


Thr 








20 










Asp 


Glu 


Ala 


Gly 


Ala 


Lys 


Gly 


Val 






35 










40 


Phe 


He 


Pro 


Tyr 


Tyr 


Pro 


Tyr 


Phe 




50 










55 




Ala 


Gly 


Ala 


Glu 


His 


Leu 


Lys 


Leu 


65 










70 






Gly 


Pro 


Val 


Thr 


Gin 
85 


Ala 


He 


Ala 


Val 


Val 


Val 


Leu 


Gly 


Val 


Asn 


Glu 








100 










Thr 


Gin 


Leu 


He 


Phe 


Asp 


Thr 


Asn 






115 










120 


Lys 


He 


Thr 


Pro 


Thr 


Tyr 


His 


Glu 



Ala 


Ala 
10 


Ala 


Val 


Gin 


He 


Thr 
15 


Pro 


Val 


Lys 


Lys 


Val 


Cys 


Lys 


Val 


He 


25 










30 






Gin 


He 


He 


Val 


Phe 
45 


Pro 


Glu 


Thr 


Ser 


Phe 


He 


Thr 
60 


Pro 


Pro 


Val 


Ser 


Tyr 


Glu 


Lys 


Ser 


Val 


Val 


He 


Pro 




75 










80 


Glu 


Arg 
90 


Ala 


Arg 


Val 


Asn 


Gin 
95 


Met 


Arg 


Asp 


Asn 


Gly 


Ser 


Leu 


Tyr 


Asn 


105 










110 






Gly 


Glu 


Leu 


Met 


Leu 


Lys 


Arg 


Arg 








125 








Arg 


Met 


He 


Trp 


Gly 


Gin 


Gly 


Asp 
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130 










135 










140 










Ala 


Ser 


Gly 


Leu 


Lys 


Val 


Val 


Glu 


Thr 


Ser 


He 


Ala 


Arg 


Val 


Gly 


Ala 


145 






150 










155 










160 


Leu 


Ala 


Cys 


Trp 


Glu 


His 


Tyr 


Asn 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ser 


Leu 


Met 






165 










170 










175 




Thr 


Gin 


His 


Glu 
180 


Glu 


lie 


His 


Cys 


Ala 
185 


Gin 


Phe 


Pro 


Gly 


Ser 
190 


Met 


Val 


Glv 


Gin 


lie 


Phe 


Ala 


Asp 


Gin 


Met 


Asp 


Val 


Thr 


He 


Arg 


His 


His 


Ala 




195 










200 










205 








Leu 


Glu 


Ser 


Gly Cys 


Phe 


Val 


He 


Asn 


Ala 


Thr 


Gly 


Trp 


Leu 


Thr 


Asp 




210 










215 










220 










Glu 


Gin 


lie 


Gin 


Ser 


He 


Thr 


Asp 


Asp 


Pro 


Lys 


Met 


Gin 


Lys 


Ala 


Leu 


225 










230 










235 








His 


240 


Arg 


Glv 

J: 


Gly 


Cys 


Asn 


Thr 


Ala 


He 


He 


Ser 


Pro 


Glu 


Gly 


Val 


Leu 




245 










250 










255 




Thr 


Glu 


Pro 


Leu 


Arg 


Glu 


Gly 


Glu 


Gly 


He 


Leu 


He 


Ala 


Asp 


Leu 


Asp 








260 








265 










270 






Met 


Ser 


Leu 


lie 


Thr 


Lys 


Arg 


Lys 


Arg 


Met 


Met 


Asp 


Ser 


Val 


Gly 


His 






275 








280 










285 








Tyr 


Ser 


Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


Leu 


Ala 


He 


Asn 


Asp 


Lys 


Pro 


Ala 


290 








2 95 










300 










Thr 


Thr 


Lys 


Phe 


Ser 


Met 


Thr 


Glu 


Gly 


Cys 


Thr 


Gin 


Thr 


Glu 


Gin 


Phe 


305 








310 










315 










320 


Arg 


lie 


Ala 


Glu 


Glu 


Leu 


Lys 


Asn 


Asp 


Asp 


Lys 


Leu 


Ser 


Thr 


Gly 


Asn 








32 5 










330 










335 





<210> 313 
<211> 987 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 313 

atggcggccg ttcaggctgc gccggttccg tttgatgctg aggcttcggt ggataaggcc 60 

tgtcgcttaa ttcaagaagc tgcagccaaa ggcgcagata tagttgcttt cggtgaggca 120 

tggctacccg gctaccccta ttttgcctgg ttaccccaag taacaccaga gtggtatagt 18 0 

gcggctgccg attatcttgc cagctctgtt gatatccctg gcccgatcac cgataaactt 24 0 

tgccaggctg cccgtcgtgc atcggttgaa ctcatcatgg gtgtggtaga acgcagcaag 300 

tctcagggaa ccacctattg cacgcttctt tttattagca aggatggcga aataattggc 3 60 

aagcaccgca agctgaagcc cacactcgcc gaacgaaccg tctggggtga aggcgatgcc 42 0 

agcggactga gggttcacga tcggcctatt gccagaatca gtgggctctc ctgctgggag 4 80 

aacaaaatga tgctgccagg ttacgcactg atggcgcagg gtacgcaggt gcatgtctcc 540 

gcctggccag ggatcccaga ggattcaccc atggaggtgc ctgcacaccc ccgtcaaaag 600 

ctgctttccc aagcctttgc actgcaaggc gggtgctatg ttatttctcc ctccattgtc 660 

cttagggcag aagatgtgcc cgagaaacac gccgctctac tgatgggaga ccaagtgggc 7 20 

ggtagctata tcattgaccc ctgcggaaaa gtgatcgccg aggccggtgc gggtgagact 7 80 

atcctgattg ccaaaggcaa ccttgacctc gtaagggccg ccaaaatggc cagtgatgta 840 

ggggggtctt attcacgccc ggatcttttg cagttgatga tcaataaccg accactcgaa 900 
cagctgattg agttcagtgc tgaaggtgca ggtaggggga atctagtatc caactcaccc 
gaggtgtcag aacaagaagg tgagtaa 



<210> 314 
<211> 328 
<212> PRT 
<213> Unknown 



960 
987 



<220> 

<223> Obtained from an environmental sample 
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<400> 314 
Met Ala Ala Val 
1 

Val Asp Lys Ala 
20 

Asp lie Val Ala 
35 

Ala Trp Leu Pro 
50 

Tyr Leu Ala Ser 
65 

Cys Gin Ala Ala 

Glu Arg Ser Lys 
100 

Ser Lys Asp Gly 
115 

Leu Ala Glu Arg 
130 

Val His Asp Arg 
145 

Asn Lys Met Met 

Val His Val Ser 
180 

Val Pro Ala His 
195 

Gin Gly Gly Cys 
210 

Asp Val Pro Glu 
225 

Gly Ser Tyr lie 

Ala Gly Glu Thr 
260 

Ala Ala Lys Met 
275 

Leu Leu Gin Leu 
290 

Phe Ser Ala Glu 
305 

Glu Val Ser Glu 



Gin Ala Ala Pro 
5 

Cys Arg Leu lie 

Phe Gly Glu Ala 
40 

Gin Val Thr Pro 
55 

Ser Val Asp lie 
70 

Arg Arg Ala Ser 
85 

Ser Gin Gly Thr 

Glu He He Gly 
120 

Thr Val Trp Gly 
135 

Pro He Ala Arg 
150 

Leu Pro Gly Tyr 
165 

Ala Trp Pro Gly 

Pro Arg Gin Lys 
200 

Tyr Val He Ser 
215 

Lys His Ala Ala 

230 

He Asp Pro Cys 

245 

He Leu He Ala 

Ala Ser Asp Val 
280 

Met He Asn Asn 
295 

Gly Ala Gly Arg 
310 

Gin Glu Gly Glu 
325 



Val Pro Phe Asp 
10 

Gin Glu Ala Ala 
25 

Trp Leu Pro Gly 

Glu Trp Tyr Ser 
60 

Pro Gly Pro He 
75 

Val Glu Leu He 
90 

Thr Tyr Cys Thr 
105 

Lys His Arg Lys 

Glu Gly Asp Ala 
140 

He Ser Gly Leu 
155 

Ala Leu Met Ala 
170 

He Pro Glu Asp 
185 

Leu Leu Ser Gin 

Pro Ser He Val 
220 

Leu Leu Met Gly 
235 

Gly Lys Val He 
250 

Lys Gly Asn Leu 
265 

Gly Gly Ser Tyr 

Arg Pro Leu Glu 
. 300 

Gly Asn Leu Val 
315 



Ala Glu Ala Ser 
15 

Ala Lys Gly Ala 
30 

Tyr Pro Tyr Phe 
45 

Ala Ala Ala Asp 

Thr Asp Lys Leu 
80 

Met Gly Val Val 
95 

Leu Leu Phe He 
110 

Leu Lys Pro Thr 
125 

Ser Gly Leu Arg 

Ser Cys Trp Glu 
160 

Gin Gly Thr Gin 
175 

Ser Pro Met Glu 
190 

Ala Phe Ala Leu 
205 

Leu Arg Ala Glu 

Asp Gin Val Gly 
240 

Ala Glu Ala Gly 
255 

Asp Leu Val Arg 
270 

Ser Arg Pro Asp 
285 

Gin Leu He Glu 

Ser Asn Ser Pro 
320 



<210> 315 
<211> 960 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 315 

atgacaatgc 

gatcgagagg 

gccagcctgg 

gtgccggcgg 

gttgaaattc 

catgtggtta 

accttgctct 

accggcggcg 



ttaagaccaa 
ctacgctgga 
tggtctttcc 
gtgaagaagc 
ccagcccggc 
tgggcctgac 
atcttgaccc 
agcggctggt 



attccgggtt 
gaaagcctgc 
tgagtcgttc 
tttacttaat 
caccgaacgc 
cgaacgcaac 
acagggccaa 
ttgggcccag 



gctgcggtgc 
ggactgattg 
attccggcct 
gaactgtatg 
ttgagccagg 
agcgaggcca 
attctgggca 
ggcgacggca 



aggcagcgcc 
aggaggcggg 
atcccgattg 
ctcaactgct 
cagcgaaaaa 
gcggcggcag 
aacatcgcaa 
gtaccctgca 



tgtattcctg 
tcgcaacggg 
ggtctgggct 
ggccaacgcc 
ggctaaagtc 
cctctacaat 
gctggtgccc 
agtctacgag 



60 
120 
180 
240 
300 
360 
420 
480 
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acccccttgg gtaaactcag cggcttgatt tgctgggaaa 
tacgcgctct atgcctgggg tacgcaaatc tatatcgccg 
ccgtggcttt cgacgctgcg gcatatcgcc aaagagggcc 
ggcatggcct tgcgtaaggc tgatattccc gaccattttg 
caaaatgccg gcgagtggat caatggaggc gacagcgcca 
tttattgctg gacccctgag cgagcaggaa ggtattttgt 
cagatggccg gaccaaaatg gatgctcgat gtggccgggc 
tttgaactga ccgtccagac cgtagctcgg cccatgatta 

<210> 316 
<211> 319 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 316 

Met Thr Met Leu Lys Thr Lys Phe Arg Val Ala 

15 10 
Pro Val Phe Leu Asp Arg Glu Ala Thr Leu Glu 

20 25 
lie Glu Glu Ala Gly Arg Asn Gly Ala Ser Leu 

35 40 
Ser Phe He Pro Ala Tyr Pro Asp Trp Val Trp 

50 55 
Glu Glu Ala Leu Leu Asn Glu Leu Tyr Ala Gin 
65 70 75 

Val Glu He Pro Ser Pro Ala Thr Glu Arg Leu 

85 90 
Lys Ala Lys Val His Val Val Met Gly Leu Thr 

100 105 
Ala Ser Gly Gly Ser Leu Tyr Asn Thr Leu Leu 

115 120 
Gly Gin lie Leu Gly Lys His Arg Lys Leu Val 

130 135 
Arg Leu Val Trp Ala Gin Gly Asp Gly Ser Thr 
145 150 155 

Thr Pro Leu Gly Lys Leu Ser Gly Leu He Cys 

165 170 
Pro Leu Ala Arg Tyr Ala Leu Tyr Ala Trp Gly 

180 185 
Ala Ala Thr Trp Asp Arg Gly Glu Pro Trp Leu 

195 200 
He Ala Lys Glu Gly Arg Val Phe Val He Gly 

210 215 
Arg Lys Ala Asp He Pro Asp His Phe Glu Phe 
225 230 235 

Gin Asn Ala Gly Glu Trp He Asn Gly Gly Asp 

245 250 
Pro Glu Gly Glu Phe He Ala Gly Pro Leu Ser 

260 265 
Leu Tyr Ala Glu He Asp Pro Gly Gin Met Ala 

275 280 
Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp 

290 295 
Val Gin Thr Val Ala Arg Pro Met He Thr Ser 
305 310 315 



attatatgcc 
ccacctggga 
gggtgtttgt 
aattcaagca 
ttgtcaatcc 
acgccgagat 
actatgctcg 
cctctactca 



gctggcccgc 
tcgaggcgag 
catcggctgt 
gcgcttttat 
tgagggtgaa 
tgatcctggc 
cccggatgtc 
gccgcgatag 



540 
600 
660 
720 
780 
840 
900 
960 



Ala 

Lys 

Val 

Ala 
60 
Leu 

Ser 

Glu 

Tyr 

Pro 
140 
Leu 

Trp 

Thr 

Ser 

Cys 
220 
Lys 

Ser 

Glu 

Gly 

Val 
300 
Thr 



Val 

Ala 

Val 
45 
Val 

Leu 

Gin 

Arg 

Leu 
125 
Thr 

Gin 

Glu 

Gin 

Thr 
205 
Gly 

Gin 

Ala 

Gin 

Pro 
285 
Phe 

Gin 



Gin 

Cys 
30 
Phe 

Pro 

Ala 

Ala 

Asn 
110 
Asp 

Gly 

Val 

Asn 

He 
190 
Leu 

Met 

Arg 

He 

Glu 
270 
Lys 

Glu 

Pro 



Ala Ala 
15 

Gly Leu 

Pro Glu 

Ala Gly 

Asn Ala 
80 

Ala Lys 
95 

Ser Glu 

Pro Gin 

Gly Glu 

Tyr Glu 
160 
Tyr Met 
175 

Tyr He 

Arg His 

Ala Leu 

Phe Tyr 
240 
Val Asn 
255 

Gly He 
Trp Met 
Leu Thr 
Arg 



<210> 317 
<211> 993 
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<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 317 

atgaccattg 

accgtggaca 

actttcccgg 

gccctcaagg 

gccaccacgc 

gtcaacgaac 

accttgattc 

cagggcgacg 

gcgtgctggg 

atccactcgg 

gtcaacatcc 

ctggacgccg 

tccagtggtt 

cggtccggtg 

cagatgatgg 

cgcacggcga 

gcggacggtc 



taaaagccgc 
aggttgtcca 
agacggtggt 
tgggcaagga 
tcgccatcgg 
gcgacggcag 
agcgccgccg 
gctccgggct 
agcactacaa 
cgatgtaccc 
gccagcacgc 
atcagcaggc 
gcttcaccgc 
agggcgaggt 
attcacgcgg 
ccgcgcatgt 
catccgccgt 



tgccgtccaa 
gaagatcctc 
cccctactac 
acatctgcgc 
cgaagcctgc 
cacgatctac 
aaagatcagc 
gcgcgcggtc 
cccgctggcc 
cggttccttc 
attggaagcc 
gcagatcatg 
catcgtttcg 
gattgccgat 
gcactatgcg 
gcatgagcgc 
gaatgcgcag 



ataagccctg 
gaactcggca 
ccctacttct 
ttgctcgaac 
aagcaggcgg 
aacacgcagc 
ccgaccttcc 
gacagcgcgg 
cgctacgcca 
gcaggcgacg 
ggctgcttcg 
caggacaccg 
ccggacggcg 
ctcgacttca 
cgcccggaat 
agcgcgcatc 
taa 



tcctttacag 
agcaaggcgt 
ccttcgtcca 
agtcggtcac 
ggatggtggt 
tgctcttcga 
atgaacgcat 
tcgggcgcat 
tgatggccga 
ccttctccga 
tcgtgaacgc 
gttgcgccat 
tgttgctggg 
cgctgatcga 
tgctcagcct 
cgaaggcgac 



ccgcgaaggc 
ccagttcgcc 
gtcgggctac 
cgtgccatcg 
gtccatcggc 
tgccgacggc 
ggtctggggc 
cggccagttg 
cggcgagcag 
acagatccag 
caccgcgtgg 
cggcccgatc 
cgagcctctg 
caagcgcaag 
gttgatcgac 
tgccgagcag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
993 



<210> 318 
<211> 330 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 318 



Met 


Thr 


He 


Val 


Lys 


Ala 


Ala 


Ala 


Val 


Gin 


He 


Ser 


Pro 


Val 


Leu 


Tyr 


1 








5 










10 










15 




Ser 


Arg 


Glu 


Gly 


Thr 


Val 


Asp 


Lys 


Val 


Val 


Gin 


Lys 


He 


Leu 


Glu 


Leu 






20 










25 










30 






Gly 


Lys 


Gin 


Gly 


Val 


Gin 


Phe 


Ala 


Thr 


Phe 


Pro 


Glu 


Thr 


Val 


Val 


Pro 


35 










40 










45 








Tyr 


Tyr 


Pro 


Tyr 


Phe 


Ser 


Phe 


Val 


Gin 


Ser 


Gly 


Tyr 


Ala 


Leu 


Lys 


Val 


50 










55 










60 










Gly 


Lys 


Glu 


His 


Leu 


Arg 


Leu 


Leu 


Glu 


Gin 


Ser 


Val 


Thr 


Val 


Pro 


Ser 


65 








70 










75 










80 


Ala 


Thr 


Thr 


Leu 


Ala 


He 


Gly 


Glu 


Ala 


Cys 


Lys 


Gin 


Ala 


Gly 


Met 


Val 










85 








90 










95 




Val 


Ser 


He 


Gly 


Val 


Asn 


Glu 


Arg Asp 


Gly 


Ser 


Thr 


He 


Tyr 


Asn 


Thr 








100 










105 










110 






Gin 


Leu 


Leu 
115 


Phe 


Asp 


Ala 


Asp 


Gly 
120 


Thr 


Leu 


He 


Gin 


Arg 
125 


Arg 


Arg 


Lys 


He 


Ser 


Pro 


Thr 


Phe 


His 


Glu 


Arg 


Met 


Val 


Trp 


Gly 


Gin 


Gly Asp 


Gly 




130 










135 










140 










Ser 


Gly 


Leu 


Arg 


Ala 


Val 


Asp 


Ser 


Ala 


Val 


Gly Arg 


He 


Gly 


Gin 


Leu 


145 






150 










155 










160 


Ala 


Cys 


Trp 


Glu 


His 


Tyr 


Asn 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ala 


Met 


Met 


Ala 






165 










170 










175 




Asp 


Gly 


Glu 


Gin 


He 


His 


Ser 


Ala 


Met 


Tyr 


Pro 


Gly 


Ser 


Phe 


Ala 


Gly 




180 










185 










190 






Asp 


Ala 


Phe 


Ser 


Glu 


Gin 


He 


Gin 


Val 


Asn 


He 


Arg 


Gin 


His 


Ala 


Leu 




195 










200 










205 








Glu 


Ala 


Gly 


Cys 


Phe 


Val 


Val 


Asn 


Ala 


Thr 


Ala 


Trp 


Leu 


Asp 


Ala 


Asp 
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210 










215 










220 










Gin 


Gin 


Ala 


Gin 


He 


Met 


Gin 


Asp 


Thr 


Gly 


Cys 


Ala 


He 


Gly 


Pro 


He 


225 










230 










235 










24 0 


Ser 


Ser 


Gly 


Cys 


Phe 


Thr 


Ala 


He 


Val 


Ser 


Pro 


Asp 


Gly 


Val 


Leu 


Leu 






245 










250 










255 




Gly 


Glu 


Pro 


Leu 


Arg 


Ser 


Gly 


Glu 


Gly 


Glu 


Val 


He 


Ala 


Asp 


Leu 


Asp 






260 










265 










270 






Phe 


Thr 


Leu 
275 


He 


Asp 


Lys 


Arg 


Lys 
280 


Gin 


Met 


Met 


Asp 


Ser 
285 


Arg 


Gly 


His 


Tyr 


Ala 


Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


Leu 


Leu 


He 


Asp 


Arg 


Thr 


Ala 


Thr 


290 








295 










300 










Ala 


His 


Val 


His 


Glu 


Arg 


Ser 


Ala 


His 


Pro 


Lys 


Ala 


Thr 


Ala 


Glu 


Gin 


305 










310 










315 










320 


Ala 


Asp 


Gly 


Pro 


Ser 


Ala 


Val 


Asn 


Ala 


Gin 




















325 










330 















<210> 319 
<211> 1017 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 319 

atgacgaccc 

gcacacggca 

cgcatggtgg 

ccggcggtga 

ggcccggtca 

ggcgtcaatg 

ggcgcgctgg 

ggccagggcg 

ctggcctgct 

gagatccacg 

gcggtgacca 

tggctcaccg 

cgcggcggct 

accgagggcg 

cggatgatgg 

cttgccccca 

gcaggagccg 



cgcgtatcgt 
cggtagacaa 
tctttcccga 
cgatgggcgc 
ccctggccgt 
aacgcgacca 
tgctcaagcg 
atggcagcgg 
gggagcacta 
ccgcgcagtt 
tccgccacca 
atgcgcagat 
gccacaccgc 
aaggcatgct 
attcggtcgg 
agccggcgct 
accatgtcga 



ccgcgtcgcc 
ggtctgccgc 
gaccttcgtg 
cgcccacctc 
gggcgaagcg 
cggctcgctc 
ccgcaagctc 
tctcaaggtg 
caacccgctc 
ccccggttcc 
cgcgctggaa 
caccgccatc 
catcgtgtcg 
ggtggccgac 
ccactacgcc 
gcacacacag 
cgacgaccgc 



gcggtgcaga 
gccattctgg 
ccctactacc 
gagctctacg 
gcgcggcggg 
tacaacaccc 
accccgacct 
gtggacaccg 
gcccgctaca 
atggtcggcc 
tccggctgct 
acccccgacc 
ccggaaggca 
ctcgacatgc 
cggccggagc 
ccggctgcgt 
accgcgtccg 



tggcgcccga 
aggccggcga 
cctacttctc 
agcgcgcggt 
cgggcgcggt 
agctgatctt 
atcacgagcg 
gcatcggccg 
ccctgatggc 
agatctttgc 
tcgtggtgaa 
cggccatgca 
gctacgtctg 
ggctggtcac 
tgctttcgct 
ccctccctct 
caacagctga 



cctggaatcc 
aaaaggcgcg 
cttcatccag 
gacggtgccc 
cgtcgtgctc 
cgacgagacc 
catggtgtgg 
catcggcgcg 
gcagcatgaa 
cgaccagatg 
cgccaccggc 
gcgcgcgctg 
cgaaccgctc 
caagcgcaag 
caacgccgat 
ctcccttcag 
tctctga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1017 



<210> 320 
<211> 338 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
<400> 320 

Met Thr Thr Pro Arg He Val Arg Val Ala Ala Val Gin Met Ala Pro 

15 10 15 

Asp Leu Glu Ser Ala His Gly Thr Val Asp Lys Val Cys Arg Ala He 

20 25 30 

Leu Glu Ala Gly Glu Lys Gly Ala Arg Met Val Val Phe Pro Glu Thr 

35 40 45 

Phe Val Pro Tyr Tyr Pro Tyr Phe Ser Phe He Gin Pro Ala Val Thr 

50 55 60 

Met Gly Ala Ala His Leu Glu Leu Tyr Glu Arg Ala Val Thr Val Pro 
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65 

Gly Pro Val Thr 

Val Val Val Leu 
100 

Thr Gin Leu lie 
115 

Lys Leu Thr Pro 
130 

Gly Ser Gly Leu 
145 

Leu Ala Cys Trp 

Ala Gin His Glu 
180 

Gly Gin lie Phe 
195 

Leu Glu Ser Gly 
210 

Ala Gin lie Thr 
225 

Arg Gly Gly Cys 

Cys Glu Pro Leu 
260 

Met Arg Leu Val 
275 

Tyr Ala Arg Pro 
290 

Pro Ala Leu His 
305 

Ala Gly Ala Asp 
Asp Leu 



70 

Leu Ala Val Gly 
85 

Gly Val Asn Glu 

Phe Asp Glu Thr 
120 

Thr Tyr His Glu 
135 

Lys Val Val Asp 
150 

Glu His Tyr Asn 
165 

Glu lie His Ala 

Ala Asp Gin Met 
200 

Cys Phe Val Val 
215 

Ala lie Thr Pro 
230 

His Thr Ala lie 
245 

Thr Glu Gly Glu 

Thr Lys Arg Lys 
280 

Glu Leu Leu Ser 
295 

Thr Gin Pro Ala 
310 

His Val Asp Asp 
325 



75 

Glu Ala Ala Arg 
90 

Arg Asp His Gly 
105 

Gly Ala Leu Val 

Arg Met Val Trp 
140 

Thr Gly lie Gly 
155 

Pro Leu Ala Arg 
170 

Ala Gin Phe Pro 
185 

Ala Val Thr lie 

Asn Ala Thr Gly 
220 

Asp Pro Ala Met 
235 

Val Ser Pro Glu 
250 

Gly Met Leu Val 

265 

Arg Met Met Asp 

Leu Asn Ala Asp 
300 

Ala Ser Leu Pro 
315 

Asp Arg Thr Ala 
330 



80 

Arg Ala Gly Ala 
95 

Ser Leu Tyr Asn 
110 

Leu Lys Arg Arg 
125 

Gly Gin Gly Asp 

Arg lie Gly Ala 
160 

Tyr Thr Leu Met 
175 

Gly Ser Met Val 
190 

Arg His His Ala 
205 

Trp Leu Thr Asp 

Gin Arg Ala Leu 
240 

Gly Ser Tyr Val 
255 

Ala Asp Leu Asp 
270 

Ser Val Gly His 
285 

Leu Ala Pro Lys 

Leu Ser Leu Gin 
320 

Ser Ala Thr Ala 
335 



<210> 321 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 321 

atgagagttg tcaaagccgc tgctgtccaa ctgagtcctg tcctccatag ccgcgacgga 
acggtcgaaa aggtcgtgcg gaagatccat gaactcgccg aagagggagt cgagttcgcc 
acctttcctg agaccgtggt gccttactac ccgtactttt ccttcgttca gacgcccttg 
cagcaaatct tcggaactga gtatctgagg ctgctcgacc aggcagtcac cgtgccatcc 
gctgccaccg acgcgatcgg cgaggctgcc aggtgggctg gacttgttgt ctcgatcggc 
gtcaacgagc gagacggggg aactctctac aacactcagc ttctcttcga tgccgacggc 
agcttaattc agcggcgtcg caagatcaca cccacccatt acgagcgcat gatctggggc 
cagggcgacg gctcaggtct gcgggccgtt gatagcaagg ccggccgcat tggtcagctg 
gcatgctggg aacacaacaa ccccctggcg cgctacgcgc tgatggccga cggcgagcag 
atccattcgg ccatgtatcc gggctccatg ttcggcgact cgttttccca aaagaccgaa 
atcaatatcc ggcagcatgc gctggaatct gcgtgcttcg tcgtgaacgc aacggcctgg 
ctggacgccg atcagcaggc gcaaatcatg aaggacaccg gctgcggcat cggcccgatc 
tccggcggtt gcttcactgc gatcgttgca cccgatggta gcctgctggg cgaacccatc 
cgttccggtg agggcgtcgt cgtcgccaac ctcgacttca cgctgatcga caggcgtaag 
caggtgatgg actcgcgagg ccactacagc cggccggagt tgctcagcct cttaatagac 
cgcactccta ccgcgcacgt tcacgaacgc gctacgcacc ccacgacagg agctgagcaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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ggctccgagg atgtgttcga ggctcgcatt taa 993 

<210> 322 
<211> 330 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 322 

Met Arg Val Val Lys Ala Ala Ala Val Gin Leu Ser Pro Val Leu His 

15 10 15 

Ser Arg Asp Gly Thr Val Glu Lys Val Val Arg Lys lie His Glu Leu 

20 25 30 

Ala Glu Glu Gly Val Glu Phe Ala Thr Phe Pro Glu Thr Val Val Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser Phe Val Gin Thr Pro Leu Gin Gin lie Phe 

50 55 60 

Gly Thr Glu Tyr Leu Arg Leu Leu Asp Gin Ala Val Thr Val Pro Ser 
65 70 75 80 

Ala Ala Thr Asp Ala lie Gly Glu Ala Ala Arg Trp Ala Gly Leu Val 

85 90 95 

Val Ser lie Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Thr 

100 105 HO 

Gin Leu Leu Phe Asp Ala Asp Gly Ser Leu He Gin Arg Arg Arg Lys 

115 120 125 

He Thr Pro Thr His Tyr Glu Arg Met He Trp Gly Gin Gly Asp Gly 

130 135 140 

Ser Gly Leu Arg Ala Val Asp Ser Lys Ala Gly Arg He Gly Gin Leu 
145 150 155 160 

Ala Cys Trp Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Leu Met Ala 

165 170 175 

Asp Gly Glu Gin He His Ser Ala' Met Tyr Pro Gly Ser Met Phe Gly 

180 185 190 

Asp Ser Phe Ser Gin Lys Thr Glu He Asn He Arg Gin His Ala Leu 

195 200 205 

Glu Ser Ala Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp 

210 215 220 

Gin Gin Ala Gin He Met Lys Asp Thr Gly Cys Gly He Gly Pro He 
225 230 235 240 

Ser Gly Gly Cys Phe Thr Ala He Val Ala Pro Asp Gly Ser Leu Leu 

245 250 255 

Gly Glu Pro He Arg Ser Gly Glu Gly Val Val Val Ala Asn Leu Asp 

260 265 270 

Phe Thr Leu He Asp Arg Arg Lys Gin Val Met Asp Ser Arg Gly His 

275 280 285 

Tyr Ser Arg Pro Glu Leu Leu Ser Leu Leu He Asp Arg Thr Pro Thr 

290 295 300 

Ala His Val His Glu Arg Ala Thr His Pro Thr Thr Gly Ala Glu Gin 
305 310 315 320 

Gly Ser Glu Asp Val Phe Glu Ala Arg He 
325 330 

<210> 323 
<211> 951 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
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<400> 323 

atgagcaacg 

accctcgatc 

gtgatgcccg 

accgacctgc 

gatgacggtg 

tgcggcgtct 

gtgatcgggc 

gagcgcatgg 

gggcgcatcg 

tacgcgcagg 

gcgtcgatgc 

ttccagggcc 

gacgcctggg 

ggtccgttgc 

atggcccggc 

caggtcaacg 



tgaaagtcgc 
gcgccgtcgc 
agcacttcat 
gattgtgcga 
acctggcccc 
gcgagatcga 
ccgacggcac 
tctggggcat 
gcacgctgat 
gggtcgaggt 
agcacatcgc 
gcgacatgcc 
tcaacgccgg 
acgaggcgtt 
gcagcctgga 
cccgggcgca 



ggtcgtgcaa 
cgcgctggcc 
ccccggctac 
acagctgcac 
gttgaccgag 
caccgaattc 
gctgctcaac 
gggcgacgcc 
ttgctgggag 
cctggttgca 
ccgcgaaggc 
ggacaccttg 
ggactcggtc 
cgggctgttc 
tgtggccggg 
gccgccggtt 



cgcgcgccgg 
gaagcggcgc 
ccggcctgga 
gcgatgctgc 
gccgcgcagc 
agtcgcggca 
cggcatcgca 
acggggctga 
aactacatgc 
ccgacctacg 
ggctgctggg 
ccgggcaagg 
atcgtcgcgc 
accgccgaga 
cactatggac 
gaggtgacgc 



tgttcggcaa 
agcagggggc 
tctggcgcct 
gcgccaacgc 
ggcatgcgct 
ccctgtacaa 
agctgatgcc 
aggtggtcga 
cattcgcacg 
acgaaggccc 
tggtgggcaa 
cccagctgtt 
caggcggccg 
tcgacctctc 
ggcccgacat 
accatggctg 



ccgcgccgcc 
gaagctggtc 
gcgtccgggc 
cgtgaggctg 
caccgtggtc 
caccgtggtc 
caccaacccc 
cacgccctgc 
cgccgcgctg 
ggtatggctg 
cggctgcgca 
tcccgaggcc 
gacagtggcg 
ccgggtcgga 
cttctgcctg 
a 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
951 



<210> 324 
<211> 316 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from 
<400> 324 

Met Ser Asn Val Lys 

1 5 
Asn Arg Ala Ala Thr 
20 

Ala Gin Gin Gly Ala 
35 

Gly Tyr Pro Ala Trp 
50 

Leu Cys Glu Gin Leu 
65 

Asp Asp Gly Asp Leu 
85 

Leu Thr Val Val Cys 
100 

Gly Thr Leu Tyr Asn 
115 

Leu Asn Arg His Arg 
130 

Trp Gly Met Gly Asp 
145 

Gly Arg He Gly Thr 
165 

Arg Ala Ala Leu Tyr 
180 

Tyr Asp Glu Gly Pro 
195 

Glu Gly Gly Cys Trp 
210 

Asp Met Pro Asp Thr 
225 

Asp Ala Trp Val Asn 
245 

Arg Thr Val Ala Gly 



an environmental sample 



Val Ala Val 

Leu Asp Arg 

Lys Leu Val 
40 

He Trp Arg 
55 

His Ala Met 
70 

Ala Pro Leu 

Gly Val Cys 

Thr Val Val 
120 

Lys Leu Met 

135 
Ala Thr Gly 
150 

Leu He Cys 

Ala Gin Gly 

Val Trp Leu 
200 

Val Val Gly 

215 
Leu Pro Gly 
230 

Ala Gly Asp 



Val 

Ala 

25 

Val 

Leu 

Leu 

Thr 

Glu 
105 
Val 

Pro 

Leu 

Trp 

Val 
185 
Ala 



Asn 
Lys 
Ser 

Pro Leu His Glu 



Gin 
10 
Val 

Met 

Arg 

Arg 

Glu 
90 
He 

He 

Thr 

Lys 

Glu 
170 
Glu 

Ser 

Gly 

Ala 

Val 
250 
Ala 



Arg 

Ala 

Pro 

Pro 

Ala 
75 
Ala 

Asp 

Gly 

Asn 

Val 
155 
Asn 

Val 

Met 

Cys 

Gin 
235 
He 

Phe 



Ala Pro 

Ala Leu 

Glu His 

45 
Gly Thr 
60 

Asn Ala 

Ala Gin 

Thr Glu 

Pro Asp 
125 
Pro Glu 
140 

Val Asp 

Tyr Met 

Leu Val 

Gin His 
205 
Ala Phe 
220 

Leu Phe 
Val Ala 
Gly Leu 



Val 

Ala 
30 
Phe 

Asp 

Val 

Arg 

Phe 
110 
Gly 

Arg 

Thr 

Pro 

Ala 
190 
He 

Gin 

Pro 

Pro 

Phe 



Phe Gly 
15 

Glu Ala 

lie Pro 

Leu Arg 

Arg Leu 
80 

His Ala 
95 

Ser Arg 

Thr Leu 

Met Val 

Pro Cys 
160 
Phe Ala 
175 

Pro Thr 

Ala Arg 

Gly Arg 

Glu Ala 
240 
Gly Gly 
255 

Thr Ala 
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260 265 270 

Glu lie Asp Leu Ser Arg Val Gly Met Ala Arg Arg Ser Leu Asp Val 

275 280 285 

Ala Gly His Tyr Gly Arg Pro Asp lie Phe Cys Leu Gin Val Asn Ala 

290 295 300 

Arg Ala Gin Pro Pro Val Glu Val Thr His His Gly 
305 310 315 



<210> 325 
<211> 1077 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 325 

atgactgcaa 

gccgacggca 

caactgatcg 

cctgcggttt 

ggtccactga 

ggcgtcaacg 

ggcacgctcc 

ggcatgggcg 

ctggcctgct 

gagatccatt 

gaaatcacca 

tggctgacgc 

tccggtggtt 

cgcgacggcg 

cgaatgatgg 

aaccgatcaa 

gagatggaag 

ggcatcgaag 



aaaagattgt 
cccttgtgaa 
tctttcccga 
cgatgggcgc 
ccgacgccgt 
agcgcgatca 
cactgaagcg 
atggctccgg 
gggaacacta 
gcagccagtt 
tgcgtcatca 
ctgagcaggt 
gctacaccgc 
aaggcatggt 
attcggttgg 
ctgcaccgct 
cacagcgcct 
ccgcctacat 



ccgcgccgcc 
agtgttgcag 
gacggtggtg 
ggcgcatctg 
cgccgcagcc 
cggcacgctc 
tcgcaagatc 
cctgcgcacg 
caacccgctg 
ccccggctcg 
cgcgctggaa 
gcgatcacag 
gatcatttcg 
catcgccgat 
ccactacgca 
gacaacgtcg 
gtcacgttat 
caatgccctc 



gctgtccagc 
gcgattgccg 
ccctactatc 
aaactgtacg 
gcgcgggcac 
tacaacacgc 
acgccgacct 
gtgaagaccg 
gcacgctacg 
ctggtcggcc 
tccggctgct 
gcgccaacac 
ccggaaggaa 
cttgattttg 
cggccggaat 
ccggtggccg 
ctcgatgcca 
agctcttttt 



tcaatcccgt 
acgccgcagc 
cttacttttc 
aacaatcgcc 
atcagatggt 
aactgatctt 
atcacgagcg 
aggtcggaac 
cgctgatggc 
cgatcttttc 
tcgtcgtcaa 
cggcaatgga 
aacatctggg 
atctcatcac 
tgttgagcct 
ccgcagcgcc 
gctccggcag 
caggaaaacc 



gctggacagc 
gcagggcgtg 
cttcgtcacg 
cacggtgcca 
ggttgtgctc 
cgacgccgac 
catggtctgg 
cgttggcgcg 
gcagcacgaa 
cgagcagatg 
cgccactgca 
aaaagccttc 
cgaacctctt 
caagcgcaag 
gcagctcgac 
gtcgcttgca 
cgccgcacaa 
ctcatga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1077 



<210> 326 
<211> 358 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 326 




























Met 


Thr 


Ala 


Lys 


Lys 


He 


Val 


Arg 


Ala 


Ala 


Ala 


Val 


Gin 


Leu 


Asn 


Pro 


1 






5 










10 










15 




Val 


Leu 


Asp 


Ser 


Ala 


Asp 


Gly 


Thr 


Leu 


Val 


Lys 


Val 


Leu 


Gin 


Ala 


He 






20 










25 










30 






Ala 


Asp 


Ala 


Ala 


Ala 


Gin 


Gly 


Val 


Gin 


Leu 


He 


Val 


Phe 


Pro 


Glu 


Thr 




35 










40 










45 








Val 


Val 


Pro 


Tyr 


Tyr 


Pro 


Tyr 


Phe 


Ser 


Phe 


Val 


Thr 


Pro 


Ala 


Val 


Ser 




50 








55 










60 










Met 


Gly 


Ala 


Ala 


His 


Leu 


Lys 


Leu 


Tyr 


Glu 


Gin 


Ser 


Pro 


Thr 


Val 


Pro 


65 








70 










75 






His 




80 


Gly 


Pro 


Leu 


Thr 


Asp 


Ala 


Val 


Ala 


Ala 


Ala 


Ala 


Arg 


Ala 


Gin 


Met 








85 










90 










95 




Val 


Val 


Val 


Leu 
100 


Gly 


Val 


Asn 


Glu 


Arg 
105 


Asp 


His 


Gly 


Thr 


Leu 
110 


Tyr 


Asn 


Thr 


Gin 


Leu 
115 


lie 


Phe 


Asp 


Ala 


Asp 
120 


Gly 


Thr 


Leu 


Pro 


Leu 
125 


Lys 


Arg 


Arg 
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Lys 


He 


Thr 


Pro 


Thr 


Tyr 


His 


GJLu 


Arg 


Met 


Val 


Trp 


Gly 


we u 


Giy 


Asp 




130 










loo 










1 A C\ 

14 U 










Gly 


Ser 


Gly 


Leu 


Arg 


Thr 


Val 


Lys 


Thr 


Glu 


Val 


Gly 


Thr 


vai 


biy 


Ma 


1 A C 

14o 










ICO 

lou 










1 DD 










1 £H 


Leu 


Ala 


Cys 


Trp 


Glu 


His 


Tyr 


Asn 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ala 


Leu 


wei- 










165 










17 0 










I/O 




Ala 


Gin 


His 


Glu 


Glu 


He 


His 


Cys 


Ser 


Gin 


Phe 


Pro 


Gly 


Ser 


Leu 


val 








loU 










1 O 3 










i an 






Gly 


Pro 


He 


Phe 


Ser 


Glu 


Gin 


Met 


Glu 


He 


Thr 


Met 


Arg 


His 


His 


Ala 






195 










ZUU 










one 
Z UD 








Leu 


Glu 


Ser 


Gly 


Cys 


Phe 


Val 


Val 


Asn 


Ala 


Thr 


Ala 


Trp 


Leu 


Thr 


Pro 




210 










215 










o o n 
ZZU 










Glu 


Gin 


Val 


Arg 


Ser 


Gin 


Ala 


Pro 


Thr 


Pro 


Ala 


Met 


Glu 


Lys 


Ala 


Phe 


225 










230 










235 












Ser 


Gly 


Gly 


Cys 


Tyr 


Thr 


Ala 


He 


lie 


Ser 


Pro 


Glu 


Gly 


Lys 


His 


Leu 










245 










250 










255 




Gly 


Glu 


Pro 


Leu 


Arg 


Asp 


Gly 


Glu 


Gly 


Met 


Val 


lie 


Ala 


Asp 


Leu 


Asp 








2 60 










2 65 










O "7 A 

z /u 






Phe 


Asp 


Leu 


lie 


Thr 


Lys 


Arg 


Lys 


Arg 


Met 


Met 


Asp 


Ser 


val 


Gly 


His 






275 










28 0 










Zoo 








Tyr 


Ala 


Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


Leu 


Gin 


Leu 


Asp 


Asn 


Arg 


Ser 


inr 




o on 










£~ Z2 *J 










300 










Ala 


Pro 


Leu 


Thr 


Thr 


Ser 


Pro 


Val 


Ala 


Ala 


Ala 


Ala 


Pro 


Ser 


Leu 


Ala 


305 










310 










315 










320 


Glu 


Met 


Glu 


Ala 


Gin 


Arg 


Leu 


Ser 


Arg 


Tyr 


Leu 


Asp 


Ala 


Ser 


Ser 


Gly 










325 










330 










335 




Ser 


Ala 


Ala 


Gin 


Gly 


He 


Glu 


Ala 


Ala 


Tyr 


He 


Asn 


Ala 


Leu 


Ser 


Ser 








340 










345 










350 






Phe 


Ser 


Gly 


Lys 


Pro 


Ser 























355 



<210> 327 
<211> 975 
<212> DNA 
<213> Unknown 

<22 0> 

<223> Obtained from an environmental sample 



<400> 327 

atggttgatc 

gctccaatct 

getgeaaage 

ttttggattg 

gegcactata 

acatcggccg 

ggegggcaca 

gtgeatcgea 

catggtttgc 

aactggatgc 

atetggcegg 

agatcgtttg 

cctcacttag 

ctcgccggac 

gttgcgactg 

cactattccc 

aagctcaacg 



aaatacttaa 
ggctgaaccg 
aagattgtca 
aacttacaga 
tggatcaggc 
caaacaagat 
gettatatge 
aacttatgee 
gaaegcatea 
cattgccacg 
gaagcattca 
taatgtctac 
acaaaatact 
eggaeggtea 
tgaaccacaa 
gtcctgatgt 
attaa 



egategttet 
cgaaaaaaca 
tcttgttgct 
tggegegega 
agtacagata 
tgccgtgatc 
ttcattggtt 
aacttatgaa 
actaggtget 
cgccgcattg 
caataegcaa 
ttccgggttc 
tgeaaactet 
gtggatcgtt 
geaggtcegt 
cacgcagttg 



gaattattaa 
ettgegaaag 
tttggtgaag 
ttcaactcca 
gagaatggtc 
gttgggtgca 
tttattaatc 
gaacgtttaa 
ttcactgttg 
tacgeacaag 
gatattaege 
atgcgtaaag 
ccgaatgtac 
gaaccatttg 
gaagaaagac 
attgtgaaca 



cagttggttt 
tggttgagaa 
ctttggtccc 
atgttcaaaa 
atttaaaagc 
ttgaacgege 
ctcaagggca 
cctggtcacc 
gcggcttgaa 
gtgaagactt 
getatattge 
aagactttcc 
ttgcaaatgg 
tgaatgagga 
aaaattttga 
gacaaagaca 



ggegcagatt 
agtaaatcaa 
cggatatccc 
agaaattcac 
gctttgtgaa 
agccgatcgc 
aateggateg 
tggcgatgga 
ttgttgggaa 
teatgtcgea 
gaaggaatcc 
atctgatacg 
tggatcctgt 
aaaattagtt 
teeggtegga 
atcaacaatc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
975 



<210> 328 
<211> 324 
<212> PRT 
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<213> Unknown 
<220> 

<223> Obtained from an environmental sample 
<400> 328 

Met Val Asp Gin lie Leu Asn Asp Arg Ser Glu Leu Leu Thr Val Gly 

15 10 15 

Leu Ala Gin lie Ala Pro He Trp Leu Asn Arg Glu Lys Thr Leu Ala 

20 25 30 

Lys Val Val Glu Lys Val Asn Gin Ala Ala Lys Gin Asp Cys His Leu 

35 40 45 

Val Ala Phe Gly Glu Ala Leu Val Pro Gly Tyr Pro Phe Trp He Glu 

50 55 60 

Leu Thr Asp Gly Ala Arg Phe Asn Ser Asn Val Gin Lys Glu He His 
65 70 75 80 

Ala His Tyr Met Asp Gin Ala Val Gin He Glu Asn Gly His Leu Lys 

85 90 95 

Ala Leu Cys Glu Thr Ser Ala Ala Asn Lys He Ala Val He Val Gly 

100 105 HO 

Cys He Glu Arg Ala Ala Asp Arg Gly Gly His Ser Leu Tyr Ala Ser 

115 120 125 

Leu Val Phe He Asn Pro Gin Gly Gin He Gly Ser Val His Arg Lys 

130 135 140 

Leu Met Pro Thr Tyr Glu Glu Arg Leu Thr Trp Ser Pro Gly Asp Gly 
145 150 155 160 

His Gly Leu Arg Thr His Gin Leu Gly Ala Phe Thr Val Gly Gly Leu 

165 170 175 

Asn Cys Trp Glu Asn Trp Met Pro Leu Pro Arg Ala Ala Leu Tyr Ala 

180 185 190 

Gin Gly Glu Asp Phe His Val Ala He Trp Pro Gly Ser He His Asn 

195 200 205 

Thr Gin Asp lie Thr Arg Tyr He Ala Lys Glu Ser Arg Ser Phe Val 

210 215 220 

Met Ser Thr Ser Gly Phe Met Arg Lys Glu Asp Phe Pro Ser Asp Thr 
225 230 235 240 

Pro His Leu Asp Lys He Leu Ala Asn Ser Pro Asn Val Leu Ala Asn 

245 250 255 

Gly Gly Ser Cys Leu Ala Gly Pro Asp Gly Gin Trp He Val Glu Pro 

260 265 270 

Phe Val Asn Glu Glu Lys Leu Val Val Ala Thr Val Asn His Lys Gin 

275 280 285 

Val Arg Glu Glu Arg Gin Asn Phe Asp Pro Val Gly His Tyr Ser Arg 

290 295 300 

Pro Asp Val Thr Gin Leu He Val Asn Arg Gin Arg Gin Ser Thr He 
305 310 315 320 

Lys Leu Asn Asp 



<210> 329 
<211> 1023 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 329 

gtgccggcca aagtacgcgc cgccgccgtg cagctcagcc ccgtgctctt cagccgtgaa 



60 



ggcacgacca gcaaggtctg cgacaagatt gccgaggcgg cggcgcaggg cgccgagctg 120 
gtggtgtttc ccgaaaccgt agtgccgtat tacccgtatt tctcgttcat caaggctccg 180 



Page 215 



WO 03/097810 



PCT/US03/15712 



gccgtgatcg 

agcgtcgaag 

aacgaacgcg 

ttggcgcaag 

ggcgatggct 

tgctgggagc 

cacgtggcca 

acgattcggc 

agcgacgcgc 

ggttgcttca 

ggtgagggca 

atggacagcg 

ccgcagccgc 

tcgctcgctc 

tga 



gcgccgagca 
ccatcgcaga 
atcacggcac 
cccgccgcaa 
cgggcttggt 
actacaaccc 
tgttccctgg 
accacgcgct 
aggtgacgca 
ccgccatcgt 
tggtcattgc 
tcggccatta 
atttgcgcga 
ccgatagtaa 



cttactcttg 

agccgctcgc 

gctgtacaac 

gatcaccccc 

ggcggtggat 

gctggctcgc 

ctcgctcgtg 

cgagtcgggc 

gatcgcgggc 

atcgcccgag 

cgacctcgat 

cagccggccc 

gaaaactgcc 

cggcctccgc 



<210> 330 
<211> 340 
<212> PRT 
<213> Unknown 



ctcgaacaag 
aaggcgggcg 
acccagctgt 
acgtatcacg 
acgcgagtcg 
tatgcgctga 
ggcgacatct 
tgcttcgtcg 
gacaccaagc 
ggcacgctgc 
ctgtcgctca 
gagctgctca 
tctttacccg 
gacgctgacg 



ccgtcacggt 

cggtggtttc 

tgttcgacgc 

agcggatgat 

gcaggattgg 

tggccgacca 

tccgcgagca 

tcaacgcgac 

tcgaccgcgc 

tggcgccacc 

tcgccaaacg 

gcgtgctgat 

aacctcggat 

cgaaagcctc 



gcccgggccc 
gatcggcgtc 
cgacggtcgg 
ctgggggcag 
ctccctggcg 
cgagcaaatt 
aatcgaggtc 
gggctacctg 
cctgcgcggc 
gctcaccgac 
caaacgcatg 
cgaccgctcg 
gtcccatgaa 
gaccctctcc 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1023 



<220> 

<223> Obtained from 



an environmental sample 



<400> 330 

Val Pro Ala Lys Val 

1 5 
Phe Ser Arg Glu Gly 
20 

Ala Ala Ala Gin Gly 
35 

Pro Tyr Tyr Pro Tyr 
50 

Ala Glu His Leu Leu 
65 

Ser Val Glu Ala lie 
85 

Ser lie Gly Val Asn 
100 

Leu Leu Phe Asp Ala 
115 

Thr Pro Thr Tyr His 
130 

Gly Leu Val Ala Val 
145 

Cys Trp Glu His Tyr 
165 

His Glu Gin lie His 
180 

lie Phe Arg Glu Gin 
195 

Ser Gly Cys Phe Val 
210 

Val Thr Gin lie Ala 
225 

Gly Cys Phe Thr Ala 
245 

Pro Leu Thr Asp Gly 
260 

Leu lie Ala Lys Arg 
275 



Arg Ala 

Thr Thr 

Ala Glu 

Phe Ser 

55 
Leu Leu 
70 

Ala Glu 

Glu Arg 

Asp Gly 

Glu Arg 
135 
Asp Thr 
150 

Asn Pro 
Val Ala 
He Glu 



Val Asn 
215 
Gly Asp 
230 

He Val 
Glu Gly 
Lys Arg 



Ala Ala 

Ser Lys 

25 
Leu Val 
40 

Phe He 

Glu Gin 

Ala Ala 

Asp His 
105 
Arg Leu 
120 

Met He 

Arg Val 

Leu Ala 

Met Phe 
185 
Val Thr 
200 

Ala Thr 



Val Gin Leu 
10 

Val Cys Asp 

Val Phe Pro 

Lys Ala Pro 
60 

Ala Val Thr 
75 

Arg Lys Ala 
90 

Gly Thr Leu 

Ala Gin Ala 

Trp Gly Gin 
140 

Gly Arg He 

155 
Arg Tyr Ala 
170 

Pro Gly Ser 
He Arg His 



Thr Lys 

Ser Pro 

Met Val 
265 
Met Met 
280 



Gly Tyr Leu 
220 

Leu Asp Arg 

235 
Glu Gly Thr 
250 

He Ala Asp 
Asp Ser Val 



Ser Pro 

Lys He 

30 
Glu Thr 
45 

Ala Val 

Val Pro 

Gly Ala 

Tyr Asn 
110 
Arg Arg 
125 

Gly Asp 

Gly Ser 

Leu Met 

Leu Val 
190 
His Ala 
205 

Ser Asp 



Val Leu 
15 

Ala Glu 

Val Val 

He Gly 

Gly Pro 

80 
Val Val 
95 

Thr Gin 

Lys He 

Gly Ser 

Leu Ala 
160 
Ala Asp 
175 

Gly Asp 
Leu Glu 
Ala Gin 



Ala Leu 

Leu Leu 

Leu Asp 
270 
Gly His 
285 



Arg Gly 
240 
Ala Pro 
255 

Leu Ser 
Tyr Ser 
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Arg Pro Glu Leu Leu Ser Val Leu 

290 295 
Leu Arg Glu Lys Thr Ala Ser Leu 
305 310 
Ser Leu Ala Pro Asp Ser Asn Gly 
325 

Ser Thr Leu Ser 
340 

<210> 331 
<211> 1041 
<212> DNA 
<213> Unknown 



lie Asp Arg Ser Pro Gin Pro His 
300 

Pro Glu Pro Arg Met Ser His Glu 
315 320 
Leu Arg Asp Ala Asp Ala Lys Ala 
330 335 



<220> 

<223> Obtained from an environmental sample 



<400> 331 

atgtccggaa 

tgtctgctcg 

aaggctggcg 

atctggctga 

attgtcgcag 

cacgtcgtga 

tatgggcccg 

cgcagcgtct 

cgcgtcggcg 

tcccaggacg 

tcgaaggcct 

caatgcttcg 

cagaacgaat 

ccgaacggtg 

gatatcgatc 

gcgagggccg 

ttcggcgagg 

gcgctcgaag 



caggacctat 
atcttgatgc 
cgcggctcat 
atccacccgc 
gcagcaagca 
tcggtgtctc 
agggcgaggt 
ttggcgaagg 
cgctatgctg 
agcagattca 
tcagccccga 
tcctgtcgtc 
tgcaccagaa 
cggaattcgg 
tcggcctgat 
acgctctggc 
cgacgcgcaa 
cggccgagtg 



cggaccggtc 
gggcgtcgag 
caattttccc 
catcaacatg 
tgaccacgca 
cgagcgcgcc 
gatctcgcgt 
cgacggcagc 
ttgggaacac 
ttgcgcggcg 
agtcagcgtc 
gtgctcggtt 
gttcctcgag 
tgagaatctc 
ctcgcactcc 
gctcatgcat 
ggttgccgac 



gtgaaggtcg 
aaggcgatcg 
gaaatttggc 
cagtatgtcg 
cttcgggccg 
ggcggcagtc 
cgtcgtaagc 
gacatgatcg 
ctgcaaccgc 
tggcccgctt 
aacgtgaacc 
atcgatcagg 
gtgggcgggg 
ccacccgata 
aagagtgccg 
aaccgtaatc 
gcactgccta 



ccgtcgtaca 
ccttgatcga 
tgccgggtta 
cgccttacat 
ccgcgcgccg 
tgtacatggc 
taaaacccac 
ttactcagac 
tgtcgaaata 
tcagcctcta 
aaatctacgc 
cgatctacga 
gctacagccg 
gggaaggcct 
ctgatccggc 
cccgccgtcc 
aaggcgcgga 



agccgcgcca 
ccaagcgggc 
tccttggtgg 
gaacaactcc 
caacaatatt 
tcagtggcac 
ccatgtcgaa 
cgatttcggg 
cgcgctcttt 
tgcaaaactc 
cgtagaaggg 
cacgttggtg 
aatcttcggg 
ggtggttgcc 
tggtcactac 
ggttatcggt 
acccgcggaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1041 



<210> 332 
<211> 346 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 332 



Met 


Ser 


Gly 


Thr 


Gly 


Pro 


He 


Gly 


1 








5 








Gin 


Ala 


Ala 


Pro 


Cys 


Leu 


Leu 


Asp 








20 










He 


Ala 


Leu 


He 


Asp 


Gin 


Ala 


Gly 






35 










40 


Phe 


Pro 


Glu 


He 


Trp 


Leu 


Pro 


Gly 




50 










55 




Pro 


Pro 


Ala 


He 


Asn 


Met 


Gin 


Tyr 


65 










70 






He 


Val 


Ala 


Gly 


Ser 


Lys 


His 


Asp 










85 








Arg 


Asn 


Asn 


He 


His 


Val 


Val 


He 






100 










Ser 


Leu 


Tyr 


Met 


Ala 


Gin 


Trp 


His 



Pro 


Val 


Val 


Lys 


Val 


Ala 


Val 


Val 




10 










15 




Leu 


Asp 


Ala 


Gly 


Val 


Glu 


Lys 


Ala 


25 










30 






Lys 


Ala 


Gly 


Ala 


Arg 


Leu 


He 


Asn 








45 








Tyr 


Pro 


Trp 


Trp 


He 


Trp 


Leu 


Asn 






60 










Val 


Ala 


Pro 


Tyr 


Met 


Asn 


Asn 


Ser 






75 










80 


His 


Ala 


Leu 


Arg 


Ala 


Ala 


Ala 


Arg 




90 










95 




Gly 


Val 


Ser 


Glu 


Arg 


Ala 


Gly 


Gly 


105 










110 






Tyr 


Gly 


Pro 


Glu 


Gly 


Glu 


Val 


He 
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<213> Unknown 
<220> 

<223> Obtained from an environmental sample 
<400> 334 

Met Ala lie Glu His Pro Arg Tyr Arg Val Ala Ala Val Gin Ala Ala 

15 10 15 

Pro Glu Phe Leu Asn Leu Glu Ala Thr Val Asp Lys Thr lie Ala Leu 

20 25 30 

lie Glu Glu Ala Ala Arg Gly Gly Ala Ser Leu lie Ala Phe Pro Glu 

35 40 45 

Thr Trp lie Pro Gly Tyr Pro Trp Phe Ala Trp Leu Gly Ala Pro He 

50 55 60 

Trp Gly Met Lys Phe lie Gin Ala Tyr His Asp Asn Ser Met Val He 
65 70 75 80 

Asp Gly Ala Gin Phe Glu Arg He Ala Gin Ala Ala Ser Arg Cys Asn 

85 90 95 

He Thr Val Val Leu Gly Phe Ser Glu Lys Asp Ala Gly Ser Leu Tyr 

100 105 HO 

He Ala Gin Ala He Leu Ser Pro Glu Gly Lys Thr He Ala Thr Arg 

115 120 125 

Arg Lys Leu Lys Pro Thr His Val Glu Arg Ala He Phe Gly Glu Gly 

130 135 140 

Asp Gly Ser Asp Leu Ala Val His Asp Thr Lys Leu Gly Arg Val Gly 
145 150 155 160 

Ala Leu Cys Cys Trp Glu His Leu Gin Pro Leu Ser Lys Tyr Ala Met 

165 170 175 

Tyr Ala Gin Asn Glu Gin Val His He Ala Ala Trp Pro Ser Phe Ser 

180 185 190 

Leu Tyr Val Asp Ala Ala Tyr Ala Leu Gly Pro Glu Val Asn Asn Ala 

195 200 205 

Ala Ser Arg Leu Tyr Ala Val Glu Gly Gin Cys Phe Val Val Ala Pro 

210 215 220 

Cys Ala Thr Val Ser Gin Lys Met He Asp Met Leu Cys Glu Thr Pro 
225 230 235 240 

Glu Gin Gin Ala Leu Leu Lys Pro Gly Gly Gly His Ala Gin He Tyr 

245 250 255 

Gly Pro Asp Gly Arg Ser Leu Ala Asp Pro Leu Pro Pro Asp Ala Glu 

260 265 270 

Gly Leu Leu Tyr Ala Asp He Asp Leu Ala Ala He Thr Leu Ala Lys 

275 280 285 

Ala Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp Val Thr Gin 

290 295 300 

Leu Leu Leu Asp Arg Asn Pro Lys Pro Arg Val Val His Ala Lys Pro 
305 310 315 320 

Gly Gin Ser Ala Asn Asn Ser Ser Pro Gly Met Arg Ala Val Glu His 

325 330 335 

Thr Glu Leu Glu Glu Gly Glu Gin Ala 
340 345 

<210> 335 
<211> 1053 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 335 

atgactacgc gcgtgattaa agtcgctgcc gcgcagctgt ctccggtgct ggcaactgca 



60 
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tcggagcaca 
caacagggcg 
tcatttatca 
gtgacggtac 
gtagtggtgc 
ttcgacgccg 
cgcatggtgt 
cgtgtcggcg 
gcacaacacg 
gccgaacaaa 
aatgcaacgg 
cagaaagcac 
tgcccaccgc 
accaaacgta 
ttgctcctcg 
gccaaatccg 
ttcattgaga 



gccgcgagga 
cgcaactgat 
cgcccgcggt 
ccagcgcagc 
tcggaattaa 
atggtgagct 
ggggacaagg 
cgctcgcgtg 
aagaaattca 
tggaaatcac 
cctggcttag 
tgcgcggcgg 
tgtatgacgg 
aacgcatgat 
atgatcgcgt 
ctgaagatgg 
cgtcgaaatt 



cacgattgcg 
tgtgtttccg 
aacgatgggt 
gacagatgct 
tgaacgcgat 
gctattaaag 
cgacggcagc 
ctgggaacat 
ttgcagtcaa 
catgcgtcat 
cgatacacaa 
ttgctacacc 
agaaggaata 
ggattccgtc 
aactgcgccg 
ttttccttta 
cgcggagtca 



aaagtgatcg 
gaaacagtcg 
gccgaacatt 
gtcgctgcgg 
cacggctcgc 
cgtcgaaaaa 
ggtttaaaag 
tacaacccgc 
tttccgggat 
cacgcactcg 
atccaatcaa 
gcaatcatct 
attgtggcgg 
ggccattacg 
ctcaaaaact 
tttgcagacg 
tga 



acgccatcgc 
ttccgtatta 
tgaaattgta 
cggcaaaaaa 
tgtacaacgc 
ttacgccgac 
ttgtcgatac 
tcgcgcgtta 
cgttggtcgg 
aatccggctg 
ttacccccga 
cgcccgaagg 
aattggactt 
cgcgaccaga 
tacagacgac 
ttttatatcc 



tgcggcgtcg 
cccgtatttt 
cgaacaggca 
ttatggcatg 
gcaattaatt 
ttatcacgaa 
tgctgccggc 
cagcctgatg 
ccctattttc 
ttttgtggtg 
taaagccatg 
caaacatctg 
cgcgttaatc 
actactttct 
gatggcctct 
agacagttct 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1053 



<210> 336 
<211> 350 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 336 
Met Thr Thr Arg 
1 

Leu Ala Thr Ala 
20 

lie Asp Ala lie 
35 

Phe Pro Glu Thr 
50 

Pro Ala Val Thr 
65 

Val Thr Val Pro 

Asn Tyr Gly Met 
100 

Ser Leu Tyr Asn 
115 

Leu Lys Arg Arg 
130 

Gly Gin Gly Asp 
145 

Arg Val Gly Ala 

Tyr Ser Leu Met 
180 

Gly Ser Leu Val 
195 

Arg His His Ala 
210 

Trp Leu Ser Asp 
225 

Gin Lys Ala Leu 

Gly Lys His Leu 
260 



Val lie Lys Val 
5 

Ser Glu His Ser 

Ala Ala Ala Ser 
40 

Val Val Pro Tyr 
55 

Met Gly Ala Glu 
70 

Ser Ala Ala Thr 
85 

Val Val Val Leu 

Ala Gin Leu lie 
120 

Lys lie Thr Pro 
135 

Gly Ser Gly Leu 
150 

Leu Ala Cys Trp 
165 

Ala Gin His Glu 

Gly Pro lie Phe 
200 

Leu Glu Ser Gly 
215 

Thr Gin lie Gin 
230 

Arg Gly Gly Cys 
245 

Cys Pro Pro Leu 



Ala Ala Ala Gin 
10 

Arg Glu Asp Thr 
25 

Gin Gin Gly Ala 

Tyr Pro Tyr Phe 
60 

His Leu Lys Leu 
75 

Asp Ala Val Ala 
90 

Gly lie Asn Glu 
105 

Phe Asp Ala Asp 

Thr Tyr His Glu 
140 

Lys Val Val Asp 
155 

Glu His Tyr Asn 
170 

Glu lie His Cys 
185 

Ala Glu Gin Met 

Cys Phe Val Val 
220 

Ser lie Thr Pro 
235 

Tyr Thr Ala lie 
250 

Tyr Asp Gly Glu 
265 



Leu Ser Pro Val 
15 

lie Ala Lys Val 
30 

Gin Leu lie Val 
45 

Ser Phe lie Thr 

Tyr Glu Gin Ala 
80 

Ala Ala Ala Lys 
95 

Arg Asp His Gly 
110 

Gly Glu Leu Leu 
125 

Arg Met Val Trp 

Thr Ala Ala Gly 
160 

Pro Leu Ala Arg 
175 

Ser Gin Phe Pro 
190 

Glu He Thr Met 
205 

Asn Ala Thr Ala 

Asp Lys Ala Met 
240 

He Ser Pro Glu 
255 

Gly He He Val 
270 
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Ala 


Glu 


Leu 


Asp 


Phe 


Ala 


Leu 


He 


Thr 


Lys 


Arg 


Lys 


Arg 


Met 


Met 


Asp 






275 








280 










285 








Ser 


Val 


Gly 


His 


Tyr 


Ala 


Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


Leu 


Leu 


Leu 


Asp 




290 








295 










300 










Asp 


Arg 


Val 


Thr 


Ala 


Pro 


Leu 


Lys 


Asn 


Leu 


Gin 


Thr 


Thr 


Met 


Ala 


Ser 


305 








310 










315 










320 


Ala 


Lys 


Ser 


Ala 


Glu 


Asp 


Gly 


Phe 


Pro 


Leu 


Phe 


Ala 


Asp 


Val 


Leu 


Tyr 








325 










330 










335 




Pro 


Asp 


Ser 


Ser 


Phe 


He 


Glu 


Thr 


Ser 


Lys 


Phe 


Ala 


Glu 


Ser 










340 










345 










350 







<210> 337 
<211> 957 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 337 

atgaagggag 

ctgcatcgcg 

ggcgcctcgc 

cgtctcaggc 

aatgccgttg 

ggcgcgactg 

ttcaacaccg 

atgccgacca 

atcgatacgc 

gcgcgctatg 

ggcgagagct 

ggcacggcaa 

ctcttcaagc 

gcgattgctg 

gaggccgcgc 

gttttctctt 



tgtgtgccat 
acaggacgat 
tgatcgtatt 
ccggaggcga 
atctcgcgaa 
tcgttatcgg 
tggtggtcat 
acccggagcg 
cggcgggacg 
ctctctatgc 
ggattgctac 
ccgtcatcca 
cggaggagtg 
ccggaccgca 
ggcgagcccg 
tctcggtcaa 



gtcgacgaaa 
tgcgaaggtg 
tcccgaagct 
catgggactg 
cggaggcctg 
catcaatgaa 
cggccccgac 
catggtgtgg 
gctgggaacc 
gcaaggcatc 
gatgcgccac 
gggcagcgat 
gatcaacgac 
caatcgacag 
ccggtcgctc 
ccgaaagcca 



gttgccatcg 
cgttcgtcga 
tatgtacccg 
tcgtctgaga 
gcgcatgtcc 
ctcgacagcg 
ggaacgcgcc 
ggcacgggcg 
atgatctgct 
gagatatacg 
atcgccaagg 
gttccggacg 
ggcgatgcgg 
aaaagcatac 
gatgtctgtg 
ttccgccctg 



tccaggcgcc 
tcgaggatgc 
gctatccgag 
ttcacgcaag 
agggggctgc 
agttcagcgg 
tcaacaggca 
atgcctcggg 
gggagagcta 
tcgctcccac 
aggccggctg 
attttcccga 
tcgtggtcaa 
tctacgccga 
gccactattc 
ccgactttgt 



gccggtcctg 
cgccaatgcg 
ttggatctgg 
attgcgggaa 
agcaaaattc 
aacgacattg 
tcgaaaatta 
tctgcgtgtc 
catgccgctg 
gtgggacgca 
ctgggtgatc 
acgcgacaag 
gcccatgggc 
catcgaccgg 
ccgcccagac 
gggttga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
957 



<210> 338 
<211> 313 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 338 




























Met 


Lys 


Gly 


Val 


Cys 


Ala 


Met 


Ser 


Thr 


Lys 


Val 


Ala 


He 


Val 


Gin 


Ala 


1 




5 










10 










15 




Pro 


Pro 


Val 


Leu 
20 


Leu 


His 


Arg 


Asp 


Gly 
25 


Arg 


Leu 


Arg 


Arg 


Cys 
30 


Val 


Arg 


Arg 


Ser 


Arg 


Met 


Pro 


Pro 


Met 




Ala 


Pro 


Arg 


Ser 


Tyr 


Phe 


Pro 


Ala 




35 










40 










45 








Tyr 


Val 


Pro 


Gly 


Tyr 


Pro 


Ser 


Trp 


He 


Trp 


Arg 


Leu 


Arg 


Pro 


Gly 


Gly 


50 








55 










60 










Asp 


Met 


Gly 


Leu 


Ser 


Ser 


Glu 


He 


Thr 


Gin 


Asp 


Cys 


Gly 


Lys 


Met 


Pro 


65 








70 










75 










80 


Leu 


He 


Ser 


Arg 


Thr 


Glu 


Ala 


Trp 


Arg 


Met 


Ser 


Arg 


Gly 


Leu 


Gin 


Gin 








85 








90 










95 




Asn 


Ser 


Ala 


Arg 
100 


Leu 


Ser 


Leu 


Ser 


Ala 
105 


Ser 


Met 


Asn 


Ser 


Thr 
110 


Ala 


Ser 


Ser 


Ala 


Glu 


Arg 


His 


Cys 


Ser 


Thr 


Val 


Val 


Val 


He 


Gly 


Pro 


Asp 


Gly 
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115 120 
Thr Arg Leu Asn Arg His Arg Lys Leu Met 

130 135 
Gly Val Gly His Gly Arg Cys Leu Gly Ser 
145 150 

Gly Gly Thr Ala Gly Asn His He Cys Trp 
165 170 
Ala Arg Tyr Ala Leu Tyr Ala Gin Gly He 

180 185 
Thr Trp Asp Ala Gly Glu Ser Trp He Ala 

195 200 
Lys Glu Ala Gly Cys Trp Val He Gly Thr 

210 215 
Ser Asp Val Pro Asp Asp Phe Pro Glu Arg 
225 230 

Arg Ser Gly Ser Thr Thr Ala Met Arg Ser 
245 250 
Arg Leu Leu Pro Asp Arg Thr He Asp Arg 

260 265 
Thr Ser Thr Gly Arg Pro Arg Gly Glu Pro 

275 280 
Gly His Tyr Ser Arg Pro Asp Val Phe Ser 

290 295 
Pro Phe Arg Pro Ala Asp Phe Val Gly 
305 310 



125 

Pro Thr Asn Pro Glu His 
140 

Ala Cys His Arg Tyr Ala 
155 160 
Glu Ser Tyr Met Pro Leu 
175 

Glu He Tyr Val Ala Pro 
190 

Thr Met Arg His He Ala 
205 

Ala Thr Val He Gin Gly 
220 

Asp Lys Leu Phe Lys Pro 
235 240 
Trp Ser Ser Pro Trp Ala 
255 

Lys Ala Tyr Ser Thr Pro 
270 

Ala Gly Arg Ser Met Cys 
285 

Phe Ser Val Asn Arg Lys 
300 



<210> 339 
<211> 1020 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 339 

atgtcaggat 

gcgggcatcg 

atcgcctttc 

gccgttacca 

caggacaagc 

tctgaacgcc 

gtgatctcgc 

ggcgacggca 

tgttgggaac 

cacatcggtt 

gacctcaaca 

gcctgcgcca 

cagttcctga 

ggcaatgtgc 

attgcgatcg 

cgactgatgt 

cgcgagattg 



cattcaaagt 
acaaggcggt 
ccgagacctg 
tgggctatgt 
gtctggcaaa 
atgacggcac 
gacggcgcaa 
gcgacatggt 
acctgcagcc 
cctggcccag 
cggcggcaag 
cggtcagtca 
ccaccggtgg 
ttgaagaaca 
ccaaggcggc 
tcaaccagaa 
tcgaggccgc 



tgcagcagtg 
ggcgctgatc 
gttgcccgga 
cgttccctac 
tgccgcgcgt 
gctctacatc 
gctcaagccg 
ggtcaaggac 
gctcaacaaa 
cttcagcctc 
ccagatgtat 
ggacatgttc 
cggattcgcc 
tgaagaaggg 
ggctgacccg 
gccaagcccg 
cgaggatcag 



caggcggctc 
gagcaagccg 
tacccctggt 
aatctcaatt 
gagaacaaca 
gcgcagtggc 
acccatgtcg 
acaagtctgg 
tacgcgatgt 
tacaagggcg 
gcggccgagg 
gacatgctct 
cgcattttcg 
ctggtgatcg 
tgcgggcact 
gtggtgatgc 
gtcagcgcaa 



cggccttcct 
ccgcgcagga 
ggatctggct 
cactagaggc 
tccaggtggt 
actatggtga 
aacgcacggt 
gacgggtcgg 
actcccagaa 
gcgcctatgc 
gccagtgctt 
gcgacacgga 
gacctgacgg 
ccgaaatcga 
attcccggcc 
cattcgaaaa 
accggctcgc 



caatctcgat 
cgttcagctc 
cgatgcaccc 
gggcagcccg 
gatgggtctg 
agatggcgag 
gttcggggaa 
cgctctgtgc 
cgagcagatc 
gcttggggca 
tgttctggct 
aatgaagcag 
ctcgcccatg 
tctcaccatg 
cgatgtgttc 
cgacgtggcc 
agcggaatga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



<210> 340 
<211> 329 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
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<400> 340 
Met Ser Gly Ser 
1 

Leu Asn Leu Asp 

20 

Arg Arg Ala Gly 

35 

Gly Tyr Pro Trp 
50 

Tyr Val Val Pro 
65 

Gin Ala Ser Gly 

Leu Ser Glu Arg 
100 

Gly Glu Asp Gly 
115 

Cys Arg Thr His 
130 

Lys Asp Thr Ser 
145 

Leu Gin Pro Leu 

Thr Ser Val Pro 
180 

Ala Trp Gly Arg 
195 

Gly Pro Val Leu 

210 

Ser Thr Cys Ser 
225 

Phe Ala Arg lie 

Glu Glu His Glu 
260 

Cys Asp Arg Gin 
275 

Cys Val Pro Thr 
290 

lie Arg Lys Thr 
305 

Ser Ala Gin Thr 



Phe Lys Val Ala 
5 

Ala Gly lie Asp 

Arg Ser Ala His 
40 

Trp He Trp Leu 
55 

Tyr Asn Leu Asn 
70 

Lys Cys Arg Ala 
85 

His Asp Gly Thr 

Glu Val He Ala 
120 

Gly Val Arg Gly 
135 

Leu Gly Arg Val 
150 

Asn Lys Thr Arg 
165 

Gly Pro Ala Ser 

Pro Gin His Gly 
200 

Cys Ser Leu Pro 
215 

Ala Thr Arg Lys 
230 

Phe Gly Pro Asp 

245 

Arg Ala Gly Asp 

Gly Gly Gly Pro 
280 

Asp Val Gin Pro 

295 

Trp Pro Ala Arg 
310 

Gly Ser Gin Arg 
325 



Ala Val Gin Ala 
10 

Gly Gly Gly Ala 
25 

Arg Leu Ser Arg 

Asp Ala Pro Ala 
60 

Ser Arg Gly Gly 
75 

Glu Gin His Pro 
90 

Leu Tyr He Ala 
105 

Thr Ala Gin Ala 

Arg Arg Arg Gin 
140 

Gly Ala Leu Cys 
155 

Cys Thr Pro Arg 
170 

Ala Ser Thr Arg 
185 

Gly Lys Pro Asp 

Ala Pro Arg Ser 
220 

Ser Ser Ser Ser 
235 

Gly Ser Pro Met 
250 

Arg Arg Asn Arg 
265 

Val Arg Ala Leu 

Glu Ala Lys Pro 
300 

Leu Ser Arg Pro 
315 

Asn 



Ala Pro Ala Phe 
15 

Asp Arg Ala Ser 
30 

Asp Leu Val Ala 
45 

Val Thr Met Gly 

Gin Pro Ala Gly 
80 

Gly Gly Asp Gly 
95 

Gin Trp His Tyr 
110 

Gin Ala Asp Pro 
125 

Arg His Val Val 

Cys Trp Glu His 
160 

Thr Ser Arg Ser 
175 

Ala Ala Leu Cys 
190 

Val Cys Gly Arg 
205 

Val Arg Thr Cys 

Thr Gly Gly Gly 
240 

Gly Asn Val Leu 
255 

Ser His His Asp 
270 

Phe Pro Ala Arg 

285 

Gly Gly Asp Ala 

Pro Arg He Arg 
320 



<210> 341 
<211> 1056 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 341 . 

atggcactga ccaacccaaa atataaagtc gccgccgtcc aggccgcgcc agcgttcctc 60 

gatctggatg cgtccgtcga aaaagcggtc cggctgatcg acgaggccgg cgccaagggc 120 

gcgcgcctca ttgcgttccc ggaaacctgg atccccggct atccctggtg gatctggctc 180 

ggcgcgccgg cctgggcgat catgaaaggt tttgtctcgg cctatttcga caattcgctc 240 

acctatgaca gtccggccgc ggacaaattg cgccaggccg ccaagcgcaa cgatatcgtc 300 

gtggtgctcg gcctgtcgga gcgcgacggc ggcagtctct atatcgcgca atggatcatc 360 

ggcccggacg gcgaaactgt cgcccagcgc cgcaagctca agccgaccca tgtcgagcgt 42 0 

tcggtgttcg gcgagggcga tggcagcgac cttgccgtgc atgagctcgc gatcgggcgc 48 0 
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540 
600 



720 
780 
840 
900 
960 
1020 
1056 



gtcggcgcgc tgtgctgctg ggagcatctg caaccgctgt cgaaatacgc catgtatgcg 
cagaacgagc aggttcacgt ggcggcctgg ccgagctttt cgctgtacga tccgttcgcc 
catgccctcg gtgcggaggt caacaacgcc gccagcaaga tctacgcggt cgagggatcg 660 
tgcttcgtcg tggcaccctg cgccaccgtt tcaaaggaaa tgatcgacct gttgtgcgac 
acgccggaca agcacggtct gctgcacgcc ggcggcgggt ttgccgcgat ctacggccct 
gacggctcgc cgatcggcga ccgcctggcg cccgaccagg aaggtctgat ctatgccgat 
gtcgatctcg gcatgatctc ggtcgcgaag gccgccgccg atccggccgg acattatgcg 
cggccggatg tcacaaggtt gctgctcaac aaacgcccgg gcaatcgtgt cgaggcgctg 
gcacttccgg tggatcaggt tgcggcaggt gaggagatcc cctcgatatc gcgatcggcc 
agaggggttg ccgaactgcc aaacgcggcc gaatag 

<210> 342 
<211> 342 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 342 

Met Ala Leu Thr Asn Pro Lys Tyr Lys Val Ala Ala Val Gin Ala Ala 

15 10 15 

Pro Ala Phe Leu Asp Leu Asp Ala Pro Ser Lys Lys Arg Ser Gly Ser 

20 25 30 

Thr Arg Pro Ala Pro Arg Ala Arg Ala Ser Leu Arg Ser Arg Thr Trp 

35 40 45 

He Pro Gly Tyr Pro Trp Trp He Trp Leu Gly Ala Pro Ala Trp Ala 

50 55 60 

He Met Lys Gly Phe Val Ser Pro He Ser Thr He Arg Ser Pro Met 
65 70 75 80 

Thr Val Arg Pro Arg Thr Asn Cys Ala Arg Pro Pro Ser Ala Thr Tyr 

85 90 95 

Ara Arg Gly Ala Arg Pro Val Gly Ala Arg Arg Arg Gin Ser Leu Tyr 

100 105 HO 

Arg Ala Met Asp His Arg Pro Thr Ala Lys Leu Ser Pro Ser Ala Ala 

115 120 125 

Ser Ser Ser Arg Pro Met Ser Ser Val Arg Cys Ser Ala Arg Ala Met 

130 135 140 

Ala Ala Thr Leu Pro Cys Met Ser Ser Arg Ser Gly Ala Ser Ala Arg 
145 150 155 160 

Cys Ala Ala Gly Ser He Cys Thr Ala Val Glu He Arg His Val Cys 

165 170 175 

Ala Glu Arg Ala Gly Ser Arg Gly Gly Leu Ala Glu Leu Phe Ala Thr 

180 185 190 

He Arg Ser Pro Met Pro Ser Val Arg Arg Ser Thr Thr Pro Pro Ala 

195 200 205 

Arg Ser Thr Arg Ser Arg Asp Arg Ala Ser Ser Trp His Pro Ala Pro 

210 215 220 

Pro Phe Gin Arg Lys Ser Thr Cys Cys Ala Thr Arg Arg Thr His Gly 
225 230 235 240 

Leu Leu His Ala Gly Gly Gly Phe Ala Ala He Tyr Gly Pro Asp Gly 

245 250 255 

Ser Pro He Gly Asp Arg Trp Arg Pro Thr Arg Lys Val Ser Met Pro 

260 265 270 

Met Ser He Ser Ala Ser Arg Ser Arg Arg Pro Pro Asp Pro Ala Gly 

275 280 285 

His Tyr Ala Arg Pro Asp Val Thr Arg Leu Leu Leu Asn Lys Arg Pro 

290 295 300 

Glv Asn Arg Val Arg Arg Trp His Phe Arg Trp He Arg Leu Arg Gin 
305 310 315 320 

Val Arg Arg Ser Pro Arg Tyr Arg Asp Arg Pro Glu Gly Cys Arg Thr 
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325 330 335 

Ala Lys Arg Gly Arg lie 
340 

<210> 343 
<211> 942 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 343 

atgaagtcaa aaattgcggt tattcagcga cctcctgtat tgttagacct tcaggcttca 
atcgcccgtg ccattacttc tgttgaggaa gcggcaggca agggatctga gctccttgtt 
tttcccgaga catttctgcc tggttatccg tcctggatct ggcgtctcaa gccgggcgga 
gacatggtgc tgacatctga aatccacgca aaatatcgcg cgaactctgt tgatgttgag 
cgcggggatc tggccccttt atgcgaagcg gcggcgaaac acggcgtcac aattgtcatg 
gggctcagtg aaattgatgg gcgctacagc gggactacac tctttaatac agtggtgacc 
attggcgcgg aaggagagct ccttaataga caccgcaagc tcatgccgac aaacccagag 
cgtatggtct gggggcaagg ggatgcctct ggtctgcggg ttgtcgacac gcccgtgggc 
cgcgtcggca cgctgatctg ctgggaaaac tacatgccgc tatcgcgcta tgcgctttat 540 
tctcaaaaca ttgacatcta tgtggcgccg acctgggacg cgggcgagag ctggatcgcc 600 
tccatgcagc atatcgccaa agaaggtggc tgctgggtga tcggcacggc cacggcgatg 660 
gagggctctg atgtcccagc cgacttccct cagcgggagg tgcttttccc tgatagcagc 720 
gaatggatca atgacggtga cgctgtagtg gttaaaccca tgggggcgat tgtcgcgggt 7 fin 

ccgcatcacc gggataagag tattctctat gctgagattg acgtcgaagt ggcacgcaat 
gcgcggcgct cgctcgatgt ggcggggcat tactcccggc cggatatttt ttcctttggc 
gtggatcgcc ggcctttgcc gccggttacg tttgaggatt ga 

<210> 344 

<211> 303 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



50 

Leu Lyf 
65 



130 
Val Cys 
145 



Ser 


Lys 


lie 


Ala 


Val 


He 


Gin 


Arg 


Pro 


Pro 


Val 


Leu 


Leu 


Asp 




5 










10 










15 




Ala 


Ser 


lie 


Ala 


Arg 


Ala 


He 


Thr 


Leu 


Leu 


Arg 


Lys 


Arg 


Gin 


20 








25 










30 




Val 


Asp 


Leu 


Ser 


Ser 


Leu 


Phe 


Phe 


Pro 


Arg 


His 


Phe 


Cys 


Leu 


35 










40 










45 






His 


Pro 


Gly 


Ser 


Gly Val 


Ser 


Ser 


Arg 


Ala 


Qlu 


Thr 


Trp 


Cys 








55 










60 










Ser 


Thr 


Gin 


Asn 


He 


Ala 


Arg 


Thr 


Leu 


Asp 


Val 


Glu 


Arg 


Gly 






70 








75 










8Q 


Ala 


Pro 


Leu 


Cys 


Glu 


Ala 


Ala 


Ala 


Lys 


His 


Gly Val 


Thr 


He 






85 








90 










95 




Gly 


Gin 


Asn 


Trp 


Ala 


Leu 


Gin 


Arg 


Asp 


Tyr 


Thr 


Leu 


Tyr 


Ser 


100 










105 










110 






His 


Trp 


Arg 


Gly Arg 


Arg 


Leu 


Leu 


Asn 


Arg 


His 


Arg 


Lys 


Leu 


115 










120 










125 








Thr 


Asn 


Pro 


Glu 


Arg 


Met 


Val 


Trp 


Gly 


Gin 


Gly Asp 


Ala 


Ser 










135 










140 










Gly 


Leu 


Ser 


Thr 


Arg 


Pro 


Trp 


Ala 


Ala 


Ser 


Ala 


Arg 


Ser 


Ala 






150 








155 










160 


Thr 


Thr 


Cys 


Arg 


Tyr 


Arg 


Tyr 


Ala 


Leu 


Tyr 


Ser 


Gin 


Asn 


He 






165 








170 










175 





60 
120 
180 
240 
300 
360 
420 
480 



780 
840 
900 
942 
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Asp 


lie 


Tyr 


Val 
180 


Ala 


Pro 


Thr 


Trp 


Asp 

-IOC 


AJLa 


oJLy 


Li_LU 


Ser 


Trp 
ion 

JL j u 


Tl 0 

lie 




Ser 


Met 


Gin 
195 


His 


He 


Ala 


Lys 


Glu 
200 


Gly 


Gly 


Cys 


Trp 


val 

<c UO 


lie 


oiy 


Tnr 


Ala 


Thr 
210 


Ala 


Met 


Glu 


Gly 


Ser 
215 


Asp 


Ser 


CjJLn 


Pro 


rpl_ „ 

inr 
220 


Ser 


Leu 


Ser 




Arg 


Cys 


Phe 


Ser 


Leu 


He 


Ala 


Ala 


Asn 


b±y 


Ser 


iwr _ 4- 

Met 


Thr 


va_L 


x nr 


Leu 


225 








230 










235 










240 


Trp 


Leu 


Asn 


Pro 


Trp 


Gly 


Arg 


Leu 


Ser 


Arg 


vax 


Arg 


x. Jie 


inr 


biy 


Tip 








245 










250 










255 




Arg 


Val 


Phe 


Ser 


Met 


Leu 


Arg 


Leu 


Thr 


Arg 


Ser 


Gly 


Thr 


Gin 


Cys 


Ala 






260 










265 










270 






Ala 


Leu 


Ala 
275 


Arg 


Cys 


Gly 


Gly 


Ala 
280 


Leu 


Leu 


Pro 


Ala 


Gly 
285 


Tyr 


Phe 


Phe 


Leu 


Trp 
290 


Val 


Asp 


Arg 


Arg 


Pro 
295 


Leu 


Pro 


Pro 


Val 


Thr 
300 


Phe 


Glu 


Asp 





<210> 345 
<211> 1011 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 345 

atgaaagega 

tcgatcacca 

gcgtttccgg 

gaaggcatgc 

atgcgccgct 

gagegegatg 

cttctcaatc 

gaeggttegg 

tgggaacata 

gtcgcgggtt 

gtcaacaatg 

tgtgcgatcg 

ctcctcaatc 

ggtcgaccgc 

gatceggcaa 

ccggacgcac 

gcgccggcga 



tcaaggccgc 
aggeggaaac 
aaacctggct 
agttcatccc 
tgeaggegat 
gtggcagccg 
geegtaaatt 
acctggtggt 
tccagccgct 
ggecgagett 
ccgtcagtca 
taagccagga 
cccgcacatc 
tttgegagee 
cgatcgccgt 
tctcgctggt 
cgccgacctt 



tgccgtgcag 
attcgtcgcc 
geegggctat 
gcgctatcac 
cgcgcgcaag 
ctacatgtcc 
gaagccaacc 
ggtcgaaacg 
cgtcaagatg 
ctgcgtctac 
ggtctatgee 
gatgttcgac 
caagcccggc 
gcttgccgac 
cgcgaaggcg 
gatcaatege 
cacccccatc 



gcagcaccgg 
gaggccgccg 
ccctggttca 
gaaaacagca 
tatgaagtga 
caggtcatta 
catgtcgagc 
geatteggea 
tcgatgtatg 
cgcgatctcg 
gtggagggta 
attctggccg 
ggtggcttca 
gatgtggaag 
gccgccgatc 
gaaaagcgcg 
tccctggacg 



tattcctgaa 
cgaatggtgc 
tetggctegg 
tggagctccg 
cgctcgtcat 
ttggcgatca 
ggaeggtett 
ggctcggtgc 
cccagcatga 
cctatgccct 
gcgcctatgt 
acaagectga 
cgcagatcta 
gcatcctcta 
ctgeggggea 
cggtgatggc 
ctgcggagta 



cctcgacgca 
caagctggtg 
tgcgcccgcc 
ctcgcccgag 
gggctattcc 
gggegacate 
eggegaagge 
getcaattge 
ggaaatccat 
gggaccggaa 
tctggcaccc 
aaaggccttt 
tgegceggat 
tgccgatctc 
etattcgegg 
tgaaatcaac 

g 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1011 



<210> 346 
<211> 329 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 346 

Met Lys Ala lie Lys Ala Ala Ala 

1 5 
Asn Leu Asp Ala Ser lie Thr Lys 
20 

Ala Ala Asn Gly Ala Lys Leu Val 

35 40 
Gly -Tyr Pro Trp Phe lie Trp Leu 
50 55 



Val Gin Ala Ala Pro Val Phe Leu 

10 15 
Ala Glu Thr Phe Val Ala Glu Ala 
25 30 
Ala Phe Pro Glu Thr Trp Leu Pro 
45 

Gly Ala Pro Ala Glu Gly Met Gin 
60 
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<210> 348 
<211> 297 
<212> PRT 
<213> Unknown 

<220> 

<22 3> Obtained from an environmental sample 
<400> 348 

Met Lys Val Ala Ala He Gin Ala Ala Pro Val Tyr Leu Asp Arg Gin 

! 5 10 15 

Ala Thr Leu Glu Lys Ala Leu Ser Trp Met Lys Arg Pro Gin Thr Ala 

20 25 30 

Pro Lys Ser Ala Pro Ser Leu Arg Pro Ser Ser Arg Ala He Pro Trp 

35 40 45 

Met Asp Leu Thr Asp Gly Ala Lys Trp Asn Asp Asp Lys Gin Lys Ala 

50 55 60 

Ala Tyr Ala Cys Tyr Val Asp Ala Ala Val Glu Ala Asp Gly Pro Glu 
65 70 75 80 

Leu Gin Ala He Ala Lys Lys Ser Lys Ala Leu Gly Leu Phe Thr He 

85 90 ' 95 

Trp Ala Trp Ser Asn Ala Arg Arg Arg Pro Gly Gin Tyr He Val Pro 

10O 105 HO 

Ser Leu Pro Ser He Pro Thr Arg Val Ser Ser Ala Tyr Thr Glu Asn 

115 120 125 

Ser Cys Pro Pro Thr Pro Ser Ala Ser Cys Gly Ala Lys Ala Thr Asp 

130 135 140 

Thr Asp Ser Arg Cys Met Asn Ser Pro Ala Ser Lys Ser Ala Arg Thr 
145 150 155 160 

Ala Gly Lys He Gly Cys Pro Ala Arg Tyr Ala Met Tyr Ala Gin Gly 

165 170 175 

Glu Gin Leu His Val Ala Thr Trp Pro Gly Ser Pro Trp Leu Thr Lys 

180 185 190 

Asp He Thr Arg Phe He Ala Leu Glu Gly Arg He Tyr Val Met Ser 

195 200 205 

Val Gly Gly Val Leu Ser Ala Asn Asp He Pro Asp Ser Phe Pro Leu 

210 215 . 220 

Lvs Thr Asp Leu Leu Lys He Arg Asp Arg Tyr Leu Ser Gly Gly Thr 
225 230 235 240 

Asp Ser Arg Pro Arg Arg His His Pro Arg Arg Pro Arg Gin Lys Arg 

245 250 255 

Arq Asp His Thr Leu Arg Gly Leu Asp Leu Asn Thr Val Leu Gin Glu 

260 265 270 

Arg Gin Asn Phe Asp Pro Ala Gly His Tyr Ala Arg Pro Asp Val Ser 

275 280 285 

Asn Trp Lys Ser Thr Lys He Asp Asp 
290 295 

<210> 349 
<211> 1002 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 349 

atgacgtccc ctgttcaaac caagtacaaa gtcgcctgtg ttcaggcggc gcccgagttt 

ctcgatctcg acaaaggcgt tgccaaagcg gtgcgcctga tcgaagaagc cgccacccaa 

aaggcctcgc tgatcgcgtt tcccgaagtc tggctccccg gctatccgtg gtggatctgg 

ctcgactcgc cggcctgggg cttgcagttc gtccagcgct acttcgaaaa cgctctggtc 



60 
120 
180 
240 
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gtcggcagcc 
gtgctcggat 
aacaccggga 
gttttcggcg 
ggtgcactct 
cacgagcaga 
cagctgagcg 
tacgtggtgg 
ccgcaaaaga 
gccaagcccc 
gacctgggct 
cccgacgtgc 
cctgaata'ca 



cccaatggga 
tctccgaacg 
aggtgatcgc 
aaggcgacgg 
gctgcgccga 
ttcacattgc 
ccgaagccaa 
ccccttgcgc 
agcagctcct 
tgtgcgagcc 
tcatcggtgt 
tgcgcctttt 
cggccaccga 



gcgcttgtgc 
ggacggcagc 
cacgcggcgc 
gagccacatc 
gcacatccag 
cgcatggccc 
caacgccgcg 
gacggtgtcc 
gctggaaggc 
cattccagag 
caccaaggca 
gttcaatcgg 
gcagaagaca 



aaggccgccg 
acgctgtacc 
aaactcaagc 
gcggtgcatg 
ccactgacca 
agcttctcgg 
agccaggtct 
aaggagatgt 
ggtggctacg 
acagaagaag 
gcgtatgacc 
aagcctgccc 
gacgcggcct 



ccgacaacaa 
tcgcacaggc 
caacacacgc 
aaaccacttt 
agtacgccat 
tctaccgcgg 
atgccctgga 
tggacatgct 
ccatgattta 
gcattcttta 
ccgccggtca 
ctcgggttca 
ga 



<210> 350 
<211> 333 
<212> PRT 
<213> Unknown 



tatccatgtt 
catcatcgat 
cgaacgcacg 
gggccgcatg 
gtactcgcag 
agcagcgttc 
gggcagttgc 
ggctgattcg 
tgggcccgac 
cgcagatgtg 
ctattcacgc 
cgatttcgat 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1002 



<220> 

<223> Obtained from 
<400> 350 

Met Thr Ser Pro Val 

1 . 5 

Ala Pro Glu Phe Leu 

20 

Leu lie Glu Glu Ala 
35 

Glu Val Trp Leu Pro 
50 

Ala Trp Gly Leu Gin 
65 

Val Gly Ser Pro Gin 
85 

Asn He His Val Val 
100 

Tyr Leu Ala Gin Ala 
115 

Arg Arg Lys Leu Lys 
130 

Gly Asp Gly Ser His 
145 

Gly Ala Leu Cys Cys 
165 

Met Tyr Ser Gin His 
180 

Ser Val Tyr Arg Gly 
195 

Ala Ala Ser Gin Val 
210 

Pro Cys Ala Thr Val 
225 

Pro Gin Lys Lys Gin 
245 

Tyr Gly Pro Asp Ala 
260 

Glu Gly He Leu Tyr 
275 

Lys Ala Ala Tyr Asp 
290 



an environmental sample 



Gin Thr Lys Tyr 
Asp Leu 
Ala Thr 



Gly Tyr 

55 
Phe Val 
70 

Trp Glu 

Leu Gly 

He He 

Pro Thr 
135 
He Ala 
150 

Ala Glu 

Glu Gin 

Ala Ala 

Tyr Ala 
215 
Ser Lys 
230 

Leu Leu 

Lys Pro 

Ala Asp 

Pro Ala 
295 



Asp Lys 

25 
Gin Lys 
40 

Pro Trp 



Gin Arg 

Arg Leu 

Phe Ser 
105 
Asp Asn 
120 

His Ala 

Val His 

His He 

He His 
185 
Phe Gin 
200 

Leu Glu 

Glu Met 

Leu Glu 

Leu Cys 
265 
Val Asp 
280 

Gly His 



Lys 
10 
Gly 

Ala 

Trp 

Tyr 

Cys 
90 
Glu 

Thr 

Glu 

Glu 

Gin 
170 
He 

Leu 

Gly 

Leu 

Gly 
250 
Glu 

Leu 

Tyr 



Val 

Val 

Ser 

He 

Phe 
75 
Lys 

Arg 

Gly 

Arg 

Thr 
155 
Pro 

Ala 

Ser 

Ser 

Asp 
235 
Gly 

Pro 

Gly 

Ser 



Ala 

Ala 

Leu 

Trp 
60 
Glu 

Ala 

Asp 

Lys 

Thr 
140 
Thr 

Leu 

Ala 

Ala 

Cys 
220 
Met 

Gly 

He 

Phe 

Arg 
300 



Cys Val Gin 
15 

Lys Ala Val 
30 

He Ala Phe 
45 

Leu Asp Ser 

Asn Ala Leu 

Ala Ala Asp 
95 

Gly Ser Thr 
110 

Val He Ala 
125 

Val Phe Gly 

Leu Gly Arg 

Thr Lys Tyr 
175 

Trp Pro Ser 
190 

Glu Ala Asn 
205 

Tyr Val Val 
Leu Ala Asp 

Tyr Ala Met 

255 

Pro Glu Thr 

270 
He Gly Val 
285 

Pro Asp Val 



Ala 

Arg , 

Pro 

Pro 

Val 
80 
Asn 

Leu 

Thr 

Glu 

Met 
160 
Ala 

Phe 

Asn 

Ala 

Ser 
240 
He 

Glu 

Thr 

Leu 
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Arg Leu Leu Phe Asn Arg Lys Pro Ala Pro Arg Val His Asp Phe Asp 
305 310 315 320 

Pro Glu Tyr Thr Ala Thr Glu Gin Lys Thr Asp Ala Ala 
325 330 

<210> 351 
<211> 936 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 351 

atgattacag 

gagaaagccg 

cccgaggcat 

ttcgggttgt 

gatgacctgg 

ttgaatgagc 

ggtccggacg 

acgattcatg 

gtgggtggtc 

cagggcgtag 

atgcagcata 

ggcagtgatt 

tgggtgaatc 

cttttccgtg 

aagcggtcct 

gaccgtgatc 



caggcatcgc 
ttggcattat 
ttattccctc 
gcgaggagtt 
accctgtccg 
gcgaaggcca 
gcagcatcct 
gcttcggcga 
tgatttgctg 
aagtgtatgt 
ttgcccggga 
ttcccgcgga 
ccggtgggtc 
aggagggcat 
ttgacgtggc 
cacaggatcc 



agtcgccgct 
tcacgaggca 
ctatcccgcc 
gcacgccctg 
gggcgctgca 
gttcggtcgg 
gaaccatcat 
tgcgcgggga 
ggagaatttc 
tgcgcccacc 
aggacggtgc 
catgccgggc 
agtggttatc 
ccttgtctgt 
cggtcattac 
cgtcgagtgg 



ccggtggtgt 
gcgggtaagg 
tggggttggc 
ttgcttgata 
gccgagcatt 
gctaccctgt 
cggaaactta 
ttgaaagtgg 
atgcccctgg 
tacgaccagg 
tgggtgttat 
aaggctcaac 
gcaccgggtg 
gaattggatc 
gccaggccag 
gactga 



tggacaaaac 
gtgtgaacct 
gcctgcgtcc 
attcggtaaa 
caatgaccgt 
ttaacgccat 
tgccaaccaa 
tggatacccc 
ctcgctacgg 
gtgatgggtg 
cggccggaac 
tgtttcccga 
gggaattagt 
cggcgaaaag 
atattttcga 



aaaaaccatt 
gcttgtgttt 
cggtggagat 
tttgcaaggt 
ggtgatggga 
ggtatttatc 
tcatgagcgt 
gtgcggtcgc 
cctgtatgcc 
gataggatcc 
accgctacgc 
tgacgatgaa 
ggctggaccg 
tgctcatgcc 
gttggaaata 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
936 



<210> 352 
<211> 301 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from 



an environmental sample 



<400> 352 

Met lie Thr Ala Gly 

1 5 
Thr Lys Thr lie Glu 
20 

Arg Val Thr Cys Leu 
35 

Pro Gly Val Gly Ala 
50 

Ser Cys Thr Pro Cys 
65 

Leu Asp Pro Val Arg 
85 

Met Gly Glu Ala Arg 
100 

Gly lie Tyr Arg Ser 
115 

Tyr Ala Asn Gin Ser 
130 

lie Glu Ser Gly Gly 
145 

Leu Gly Glu Phe His 



lie Ala Val Ala 
Lys Ala 
Cys Phe 



Cys Val 

55 
Cys Leu 
70 

Gly Ala 

Arg Pro 

Gly Arg 

Ala Tyr 
135 
Tyr Pro 
150 

Ala Pro 



Val Ala 

25 
Pro Arg 
40 

Pro Val 



lie He 

Ala Ala 

Val Arg 
105 
Gin His 
120 

Asp Ser 
Val Arg 
Gly Ser 



Ala Pro Val 
10 

Leu Phe Thr 

His Leu Phe 

Glu He Ser 
60 

Arg Asn Leu 
75 

Glu His Ser 
90 

Ser Gly Tyr 

Pro Glu Pro 

Trp Leu Arg 
140 

Ser Arg Gly 

155 
Leu Gly Leu 



Val Leu 

Arg Gin 

30 
Pro Pro 
45 

Gly Cys 

Gin Gly 

Met Thr 

Pro Val 
110 
Ser Ser 
125 

Arg Cys 
Trp Ser 
Tyr Ala 



Asp 

15 

Arg 

He 

Ala 

Asp 

Val 

95 

Arg 

Glu 

Ala 

Asp 

Gin 



Lys 

Val 

Pro 

Arg 

Asp 

80 

Val 

His 

Thr 

Gly 

Leu 
160 
Gly 
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165 



Val 


Glu 


Val 


Tyr 
180 


Val 


Ala 


Pro 


Thr 


Gly 


Ser 


Ala 


Ala 


Tyr 


Cys 


Pro 


Gly 




195 










200 


Arg 


Asn 


Thr 


Ala 


Thr 


Arg 


Gin 


Phe 


210 










215 




Asn 


Cys 


Phe 


Pro 


Met 


Thr 


Met 


Asn 


225 








230 






Ala 


Pro 


Gly 


Gly 


Glu 

245 


Leu 


Val 


Ala 


He 


Leu 


Val 


Cys 
260 


Glu 


Leu 


Asp 


Pro 


Ser 


Phe 


Asp 
275 


Val 


Ala 


Gly 


His 


Tyr 
280 


Glu 


He 
290 


Asp 


Arg 


Asp 


Pro 


Gin 
295 


Asp 





170 










175 




Tyr 


Asp 


Gin 


Gly Asp 


Gly 


Trp 


He 


185 










190 






Arg 


Thr 


Val 


Leu 


Gly 
205 


Val 


He 


Gly 


Ser 


Arg 


Thr 


Cys 
220 


Arg 


Ala 


Arg 


Leu 


Gly 


He 


Pro 


Val 


Gly 


Gin 


Trp 


Leu 




235 










Z 4 U 


Gly 


Pro 


Leu 


Phe 


Arg 


Glu 


Glu 


Gly 


250 










255 




Ala 


Lys 


Ser 


Ala 


His 


Ala 


Lys 


Arg 


265 








270 






Ala 


Arg 


Pro 


Asp 


He 
285 


Phe 


Glu 


Leu 


Pro 


Val 


Glu 


Trp 


Asp 









300 



<210> 353 
<211> 1035 
<212> DNA 

<213> Psuedomonas putida ATCC 700801 



<400> 353 _ 
atgaatgctt caaaaacaca ttataaagtt gcagctgtgc aagccgctcc tgaattcctg 60 
gatttggaca aaggggttga taaggctata cgcctcatta aagaagcagc tgataacgga 120 
gcatcgctgg ttgcctttcc ggaagtttat cttccagggt atccgtggtg gatctggctt 180 
ggctctcccg catggggaat gcagtttgtt cagcgctacg tagaaaactc gcttgattta 24 0 

aaaagcgagc agttcgagcg actgtgcaaa gcggctgcta cttaccgtat tcacgttgta 300 
atgggttaca gcgaacgctc gtttggcacc ctctacctcg gtcaagcaat tatcgatgac 360 
agcgggaaag taattggtac gcgccgtaag ctcaagccca cccatgctga gcgtactgtt 42 O 

tacggtgagg gtaacggcag tgacctcagg gttttcaatt cgcaacttgg aagggtaggt 480 
gcactctgct gcgcagagca cgtacaacca ctctcgaagt ttgcgatgta cagccagcat 540 
gagcagttgc acattgcctc ttggccgagc ttctcggtat atcgcggtgg cgcatatcaa 600 
ctgagcgctg aggctaactg cgcggccacc caagtctatg ctcttgaagg ccagtgcttc 6 60 

gtaatttcag catgcgcaat cgtatctaaa gacatgctaa acgttctcat cgacacccct 
gacaagggta acctgttgca ggatggcggt ggcttcgcga tgatttatgg ccccgatggc 
gcaccgctgt gtgaacccct gggcgaacat gaagaaggca tcctttatgc cgacgtcgat 
ttgggcgcca tttccgtagc taaagcggca ctcgacccgg ttggccatta ctcgcggcca 
gatgttttgc gtctgctttt caacgatcaa ccgacacctt gcgtggaggc gttcaatccg 
gctcctgtcg gtactgatgc tccaggtaca gaccttcagg gtgatgagcc cgatgcgcaa 
ccgatatctg aataa 



720 
780 
840 
90O 
960 
1020 
1035 



<210> 354 
<211> 312 
<212> PRT 

<213> Psuedomonas putida ATCC 700801 



<400> 354 

Met Asn Ala Ser Lys Thr His Tyr 

1 5 
Pro Glu Phe Leu Asp Leu Asp Lys 
20 

He Lys Glu Ala Ala Asp Asn Gly 

35 40 
Val Tyr Ser Arg Val Ser Val Val 

50 55 
Gly Asn Ala Val Cys Ser Ala Leu 
65 70 
Ser Ser Ser Ser Asp Cys Ala Lys 
85 



Lys Val Ala Ala Val Gin Ala Ala 

10 15 
Gly Val Asp Lys Ala He Arg Leu 
25 30 
Ala Ser Leu Val Ala Phe Pro Glu 
45 

Asp Leu Ala Trp Leu Ser Arg Met 
60 

Arg Arg Lys Arg Leu He Lys Ala 

75 80 
Arg Leu Leu Leu Thr Val Phe Thr 
90 95 
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Leu 


Trp 


Leu 


Gin 


Arg 


Thr 


Leu 


Val 


Trp 


His 


Pro 


Leu 


Pro 


Arg 


Ser 


Ser 






100 










105 










110 




rilS 


Asn 


Tyr 


Arg 


Gin 


Arg 


Glu 


Ser 


Asn 


Trp 


Tyr 


Ala 


Pro 


Ala 


bin 


A-La 




115 










120 










lot; 








Pro 


Cys 


Ala 


Tyr 


Cys 


Leu 


Arg 


Gly 


Arg 


Gin 


Pro 


Gin 


Gly 


Phe 




fne 




130 










135 










1 a n 
L H u 








inr 


Ala 


Thr 


Trp 


Lys 


Gly 


Arg 


Cys 


Thr 


Leu 


Leu 


Arg 


Arg 


Ala 


Arg 


Thr 


145 






150 










155 










lOU 


Thr 


Leu 


Glu 


Val 


Cys 
165 


Asp 


Val 


Gin 


Pro 


Ala 
170 


Ala 


Val 


Ala 


Hjls 


Cys 

1 1 3 


Leu 


Leu 


Ala 


Glu 


Leu 


Leu 


Gly 


lie 


Ser 


Arg 


Trp 


Arg 


lie 


Ser 


rp \_ ~- 

inr 


ta-LU 


Arg 








180 








185 










i y u 






Gly 


Leu 


Arg 


Gly 


His 


Pro 


Ser 


Leu 


Cys 


Ser 


Arg 


Pro 


Val 


Leu 


Arg 


Asn 




195 










200 










one 








Phe 


Ser 


Met 


Arg 


Asn 


Val 


Ser 


Lys 


Asp 


Met 


Leu 


Asn 


Val 


Leu 


lie 


Asp 




210 








215 










220 










Thr 


Pro 


Asp 


Lys 


Gly 


Asn 


Leu 


Leu 


Gin 


Asp 


Gly 


Gly 


Gly 


Phe 


Ala 


Men 


225 






230 










235 










O A A 
^ Q U 


lie 


Tyr 


Gly 


Pro 


Asp 


Gly 


Ala 


Pro 


Leu 


Cys 


Glu 


Pro 


Leu 


fZl xr 

uiy 


OJ_ U. 


His 






245 










Z DU 










255 




Glu 


Glu 


Gly 


lie 


Leu 


Tyr 


Arg 


Arg 


Arg 


rne 


Gly Arg 


U-i c> 

ni o 


Phe 


Arg 


Ser 






260 










265 










270 






Ser 


Gly 


Thr 


Arg 


Pro 


Gly 


Trp 


Pro 


Leu 


Leu 


Ala 


Ala 


Arg 


Cys 


Leu 


Arg 




275 










280 










285 








Leu 


Leu 


Phe 


Asn 


Asp 


Gin 


Pro 


Thr 


Pro 


Cys 


Val 


Glu 


Ala 


Phe 


Asn 


Pro 




290 








295 










300 










Ala 


Pro 


Val 


Gly 


Thr 


Asp 


Ala 


Pro 


















305 










310 























<210> 355 
<211> 1014 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 355 

atggccgtct 

gcgatacaga 

ettattegga 

teeggtgegg 

ctccagttgc 

gecaagegtg 

gtaacgcatg 

gatgatggcg 

atccgctcaa 

tgttggtgga 

aagatgctgc 

ggctcccacg 

aactcgacgt 

agagegcaag 

gtaccaagcg 

aaagaagect 

ccctttatca 



etaaagaegg 
tggtcccaaa 
aggctcttgg 
cgtttcatcc 
tgaaagacct 
ggcaacaagt 
acaaggattc 
tgttgtctac 
gaaccgegag 
cgctccccga 
aagaagegee 
egggcagett 
acctgggega 
atgeaggega 
aacccatccc 
gggagcgttg 
agacgggtgt 



tactgtttca 
gctgggtgac 
gctgggtgcg 
cgacatgctc 
ttctcgcaag 
attcaatacc 
accgacctat 
gcccattggc 
acggctggcg 
tgatgetgat 
ggttcgcatg 
cgaaggattc 
gacaatgatt 
aggcgtggta 
ggagactttc 
gttcgatacc 
gataaacgag 



ggaaagtege 
gegcaggega 
cgttggatcg 
aaagecatte 
ggcaatgctg 
ttcgttttgg 
tgggaaaatt 
ccggtcggct 
aacaaggtca 
ccagacagcc 
gcgcggatgc 
ttcagtcegg 
gtcgacgegg 
aeggcagaag 
tggattccca 
ggtgcggatt 
tacacaccgg 



ctatccgatt 
aegtgaatea 
tgttaccaga 
agecattega 
teateggegg 
tttctccgga 
gctattaccg 
ccgtcctctg 
agatggtcgt 
cgcgcagagc 
tgggtgttcc 
aacttgegga 
gtggccgggt 
tggttttgcc 
aggaaatgee 
actacgagat 
aatatcttag 



gcatgtcgcc 
ggcagaagee 
gatgtttacc 
tggcgcccca 
ctcgtttctc 
egggtcagtc 
gggeggtace 
ttgggaattt 
gggaggctcc 
cgtgaacctc 
ggtaatacat 
tgttccctat 
acttgcccgt 
cgacaagtcc 
ggatgattgg 
ggtgaccgcg 
gtag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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1014 



<210> 356 
<211> 325 
<212> PRT 
<213> Unknown 
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<220> 

<223> Obtained from an environmental sample 



<4UU> Jbb 

Met Ala Val Ser Lys Asp Gly Thr Val Ser Gly Lys Ser Pro He Arg 

15 10 15 

Leu His Val Ala Ala He Gin Met Val Gin Ser Trp Val Thr Arg Arg 

20 25 30 

Arg Thr He Arg Gin Lys Pro Leu Phe Gly Arg Leu Leu Gly Trp Val 

35 40 45 

Arg Val Gly Ser Cys Tyr Gin Arg Cys Leu Pro Pro Val Arg Arg Phe 

50 55 60 

He Pro Thr Cys Ser Lys Pro Phe Ser Phe Asp Gly Ala Pro Leu Gin 
65 70 75 80 

Leu Leu Lys Asp Leu Ser Arg Lys Gly Asn Ala Val He Gly Gly Ser 

85 90 95 

Phe Leu Gin Ala Trp Ala Thr Ser He Gin Tyr Leu Arg Phe Gly Phe 

100 105 HO 

Ser Glv Arg Val Ser Arg Asn Ala Gin Gly Ser Pro Thr Tyr Trp Glu 

115 120 125 

Asn Cys Tyr Tyr Arg Gly Gly Thr Asp Asp Gly Val Leu Ser Thr Pro 

130 135 140 

He Gly Pro Ser Ala Pro Ser Ser Val Gly Asn Leu Ser Ala Gin Glu 
145 150 155 160 

Pro Arg Asp Gly Trp Arg Thr Arg Ser Arg Trp Val Gly Gly Ser Cys 

165 170 175 

Trp Trp Thr Leu Pro Asp Asp Ala Asp Pro Asp Ser Pro Arg Arg Ala 

180 185 190 

Val Asn Leu Arg Cys Cys Lys Lys Arg Arg Phe Ala Trp Arg Gly Cys 

195 200 205 

Trp Val Phe Arg Tyr Met Ala Pro Thr Arg Gin Leu Arg Arg He Leu 

210 215 220 

Gin Ser Gly Thr Cys Gly Cys Ser Leu Leu Asp Val Pro Gly Arg Asp 
225 230 235 240 

Asn Asp Val Asp Ala Gly Gly Arg Val Leu Ala Arg Arg Ala Gin Asp 

245 250 255 

Ala Glv Glu Gly Val Val Thr Ala Glu Gly Phe Ala Arg Gin Val Arg 

260 265 270 

Thr Lys Arg Thr His Pro Gly Asp Phe Leu Asp Ser Gin Gly Asn Ala 

275 280 285 

Glv He Gly Lys Lys Pro Gly Ser Val Gly Ser He Pro Val Arg He 

290 295 300 

Thr Thr Arq Trp Pro Arg Pro Leu Ser Arg Arg Val Thr Ser Thr His 
305 310 315 320 

Arg Asn He Leu Gly 
325 

<210> 357 
<211> 951 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 357 

atgacaaaac tagccatcgt acaaaaaccg ccagtctttc tggataagca aaaaaccatt 

gagctggccg tcgccaacat tgaagaggcc gccgccaagg gtgccgatct cgtggtgttt 

tctgaagctt tcattcccgg ctatcctgcc tggatctggc gtctacgccc cggcggtgac 

tgggggcttt cagaagagtt gcaccagcgt ttgctgcgca atgccgtcaa tgtggactcc 

gatgatctgg ctccgttgtt tgaggtcgcc cgcaagcacg aactcaccat cgtttgcggt 



60 
120 
180 
240 
300 
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atcgaggagc 
ggtcccgatg 
atggtgtggg 
attggctcat 
caaggtgtcg 
atgcagcaca 
ggcagtgata 
tgggtgaacc 
atgaacagag 
aaacgagcat 
aatacacagg 



gtgacaacaa 
gatcgttact 
ggtttggtga 
taatgtgctg 
agatctatat 
tcgcacgtga 
ttccagatga 
cgggtgattc 
agtccggtat 
tcgatgttgc 
agcagtcacc 



actaagtcaa 
gaacaaacat 
cgcatccggt 
ggaaaattac 
cgcaccgacc 
agggggctgt 
tttcccggag 
tgtagtgata 
tttgtatcac 
cgggcattat 
ctgcgtattc 



acaaccttat 
cgcaagctta 
ttaaaagtcg 
atgccgctgg 
tacgacagcg 
tgggtggtgg 
aaatccacgt 
gcacccggcg 
gagctagaca 
tcacgtcccg 
gaaaataatt 



ataacaccgt 
tgcccaccaa 
ttgataccaa 
ctcgctatgc 
gtgatggctg 
gatgtgggtg 
tgtatccaga 
gtgaaattat 
gagaaaaagt 
atatctttca 
ccataactta 



catcacgatt 
cccggagcga 
tgctggtcga 
cctatatgca 
gataggcagc 
tctcatgaaa 
tgcagatgaa 
ggccggccca 
cagcaacgct 
gctgcatgta 



<210> 358 
<211> 300 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 358 

Met Thr Lys Leu Ala He Val Gin Lys Pro 

15 10 
Gin Lys Thr He Glu Leu Ala Val Pro Thr 

20 25 
Arg Val Pro He Ser Trp Cys Phe Leu Lys 

35 40 
Leu Trp He Trp Arg Leu Arg Pro Gly Gly 

50 55 
Glu Leu His Gin Arg Leu Leu Arg Asn Ala 
65 70 

Asp Leu Ala Pro Leu Phe Glu Val Ala Arg 

85 90 
Val Cys Gly Arg Gly Ala Gin Gin Thr Lys 

100 105 
Arg His His Asp Trp Ser Arg Trp He Tyr 

. H5 120 
Cys Pro Pro Thr Arg Ser Glu Trp Cys Gly 

130 135 
Leu Lys Val Val Asp Thr Asn Ala Gly Arg 
145 150 

Trp Glu Asn Tyr Met Pro Leu Ala Arg Tyr 
165 170 
Val Glu He Tyr He Ala Pro Thr Tyr Asp 

180 185 
Gly Ser Ala Ala His Arg Thr Arg Gly Leu 

195 200 
Val Ser His Glu Arg Gin Tyr Ser Met He 

210 215 
Cys He Gin Met Gin Met Asn Gly Thr Arg 
225 230 

Asn Tyr Gly Arg Pro Asn Glu Gin Arg Val 
245 250 
Ala Arg Gin Arg Lys Val Ser Asn Ala Lys 

260 265 
Gly His Tyr Ser Arg Pro Asp He Phe Gin 

275 280 
Ser Ser His Pro Ala Tyr Ser Lys He He 
290 295 



Pro Val Phe Leu 
Leu Lys 
Leu Ser 



Asp Trp 

60 
Val Asn 
75 

Lys His 

Ser Asn 

Thr Asn 

Gly Leu 
140 
He Gly 
155 

Ala Leu 

Ser Gly 

Leu Gly 

Ser Arg 
220 
Val He 
235 

Arg Tyr 

Arg Ala 

Leu His 

Pro Leu 
300 



Arg Pro 

30 
Phe Pro 
45 

Gly Leu 



Val Asp 

Glu Leu 

Asn Leu 
110 
He Ala 
125 

Val Thr 

Ser Leu 

Tyr Ala 

Asp Gly 
190 
Gly Gly 
205 

Arg Asn 

Leu Thr 

Phe Val 

Phe Asp 
270 
Val He 
285 



Asp Lys 
15 

Pro Pro 

Ala He 

Ser Glu 

Ser Asp 

80 
Thr He 
95 

He His 

Ser Leu 

His Pro 

Met Cys 
160 
Gin Gly 
175 

Trp He 

Met Trp 

Pro Arg 

Arg Arg 
240 
Ser Arg 
255 

Val Ala 
His Arg 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
951 



<210> 359 
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<211> 1029 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 359 

atggcaaacg 

ggtacggtag 

tgcgttttcc 

gcggccatgg 

agcacgaacg 

aatgaacgcg 

ctcgtgcaac 

ggtgatgggt 

tgttgggaac 

cacgtcgcca 

accattcgcc 

agcaaggaac 

ggttgttaca 

ggcgaaggca 

atggatagcg 

ccaacccaca 

aatacacgaa 

cgcgagtaa 



tcgttcgtgc 
agaaggtagt 
cggagacggt 
gcaaagacca 
cgattgccgc 
atcacggtac 
ggcgtcgcaa 
cgggtttgcg 
attacaatcc 
tgtttccggg 
accacgcgct 
aggtcgccca 
ccgcgattgt 
tggtcgtggc 
tcgggcacta 
cggcggtcga 
caccaaagcg 



tgcggccgta 
tgctgcgatc 
tgttccctat 
catgcagctg 
ggcggccaaa 
gatatacaac 
gataaccccc 
ctgtgtcgac 
cttggcgcgc 
ttcgatggtg 
cgaaagcggc 
gttgtcacaa 
atcgcctgaa 
ggatatggat 
cagtcgcccg 
cgtcgaattc 
cacaactggc 



cagttgagcc 
cgtgacgccg 
tatccgtatt 
tacgagcaag 
caacactcga 
acgcagttgt 
acgttccacg 
acacaaatcg 
tacgcattga 
ggtcagatct 
tgtttcgtcg 
ggcacgtcgc 
ggcgtcgtac 
ctcagcctca 
gaattgctgt 
aactccaatt 
ccacgttcga 



ctgttttggg 
cctcgcaggg 
tctcgttcat 
ctgtggtcgt 
tcgtcgtttc 
tgttcgatgc 
agcgtatggt 
ggcgcatcgg 
tggccgatca 
tcgccgatca 
tcaacgctac 
tcgacgcggc 
tgggcgaacc 
tcaccaaacg 
ctctgctgat 
ccgagtctca 
accttcaagt 



tagtcgcgag 
cgcacagctg 
tcggccgccc 
gccttctccc 
aatcggcgtc 
cgacgggaca 
gtggggtcaa 
tagcctggct 
cgaagagatc 
aattcaggta 
ggggtatctg 
gttgaccggt 
gctcaccgac 
caaacgcatg 
caatcgaacg 
tcatgtcagc 
tgccgctgat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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1020 
1029 



<210> 360 
<211> 335 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 360 
Met Ala Asn Val 
1 

Gly Ser Arg Glu 
20 

Ala Ala Ser Gin 
35 

Pro Tyr Ser Val 
50 

Arg Pro His Ala 
65 

His Glu Arg Asp 

Asn Arg Arg Met 
100 

Cys Ser Met Pro 
115 

Thr Phe His Glu 
130 

Arg Cys Val Asp 
145 

Glu His Tyr Asn 

Glu lie His Val 
180 

Ala Asp Gin lie 



Val Arg Ala Ala 
5 

Gly Thr Val Glu 

Gly Ala Gin Leu 
40 

Phe Leu Val His 
55 

Ala Val Arg Ala 
70 

Cys Arg Gly Gly 
85 

Asn Ala lie Thr 

Thr Gly His Ser 
120 

Arg Met Val Trp 
135 

Thr Gin lie Gly 
150 

Pro Leu Ala Arg 
165 

Ala Met Phe Pro 
Gin Val Pro Phe 



Ala Val Gin Leu 
10 

Lys Val Val Ala 

25 

Cys Val Phe Pro 

Ser Ala Ala Arg 
60 

Ser Cys Gly Arg 
75 

Gin Thr Thr Leu 
90 

Val Arg Tyr Thr 
105 

Cys Asn Gly Arg 

Gly Gin Gly Asp 
140 

Arg lie Gly Ser 
155 

Tyr Ala Leu Met 
170 

Gly Ser Met Val 
185 

Ala Thr Thr Arg 



Ser Pro Val Leu 
15 

Ala lie Arg Asp 
30 

Glu Thr Val Val 
45 

Gly His Gly Gin 

Ala Phe Ser Gin 
80 

Asp Arg Arg Phe 
95 

Thr Arg Ser Cys 
110 

Lys lie Thr Pro 
125 

Gly Ser Gly Leu 

Leu Ala Cys Trp 
160 

Ala Asp His Glu 
175 

Gly Gin lie Phe 
190 

Ser Lys Ala Ala 
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195 










200 










9 rm 

ZUj 








Val 


Ser 


Ser 


Ser 


Thr 


Leu 


Arg 


Gly 


He 


Ala 


Arg 


Asn 


Arg 


Ala 


pi n 

lai.n 


Leu 




210 










215 










d o n 

ZZ U 








Tnr 


Ser 


Gin 


Gly 


Thr 


Ser 


Leu 


Asp 


Ala 


Ala 


Leu 


Thr 


Gly 


Gly 


Cys 


Tyr 


225 








230 










o u c. 
ZOO 










240 


Ala 


lie 


Val 


Ser 


Pro 


Glu 


Gly 


Val 


Val 


Leu 


Gly 


Glu 


Pro 


Leu 


Thr 


Asp 






245 








250 










zoo 




Gly 


Glu 


Gly 


Met 


Val 


Val 


Ala 


Asp 


Met 


Asp 


Leu 


Ser 


Leu 


He 


Pro 




260 










265 










o n n 
Z / U 






Ala 


Asn 


Ala 


Trp 


He 


Ala 


Ser 


Gly 


Thr 


Thr 


Val 


Ala 


Arg 


Asn 


Cys 


Cys 






275 








280 










285 




Phe 




Leu 


Cys 


Ser 


lie 


Thr 


Pro 


Thr 


His 


Thr 


Ala 


Val 


Asp 


Val 


Glu 


Asn 




290 










295 










300 








Ala 


Ser 


Asn 


Ser 


Glu 


Ser 


His 


His 


Val 


Ser 


Asn 


Thr 


Arg 


His 


Gin 


Ser 


305 








310 










315 






Ala 


Ser 


320 


Gin 


Leu 


Ala 


His 


Val 


Arg 


Thr 


Phe 


Lys 


Leu 


Pro 


Leu 


He 










325 








330 










335 





<210> 361 
<211> 951 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 361 

atgctaaacg agaacaatgg cgctactttc aaggttgctg ccgtgcaggc ttcaccggta 
tttcttgatc gcgctgctac aatcgacaag gcttgcgatt taattgctac ggccggacgc 
gagggggctc gcctgatcgt ctttccagaa gcgttcgtcc cggcctatcc tgattgggta 
tgggcgattc ccgcgggtga tgagggcatg ctcaatgagc tgtatgcaga attacttgcc 
aatgctgtca ccattcccag cgatgcgacc gagaggttgt gtcgcgcggc gcggcttgct 
aatgcttacg tggtgatggg gatgagcgaa cgcaatgccg aagcgagtgg cgccagcctg 
tataatacgc tgttgtatat tgatgcacag ggacaaatcc tgggaaagca ccggaagctg 
gttccaacgg gcggcgagcg cctggtatgg gcacagggag atggcagcac cctggaggtt 
tacgatactc ctttgggaaa actcggtggc ttaatctgct gggagaatta tatgccgctg 
gcacgctata ctatgtatgc ctggggcacg caaatctaca ttgcagcgac gtgggatcgc 
gggcagccat ggctatccac tttgcgacac attgctaaag agggcagagt atatgtgatc 
ggctgttgta ttgctatgcg caaagatgat atccccgacc attacgcgat gaaggagaag 
tattacgcgg aagaagacga gtggatcaat attggcgata gcgcaatcgt caatccagaa 
ggggtattta ttgccgggcc agtgcgtaag caagaagaaa tcctctacgc cgaggttgac 
ccgcgaatga tgcaggggcc aaagtggatg ctcgacgtgg caggacatta cgcgcgcccg 
gatgtattcc agttgacggt gcacacggag aggcggcaga tgatccacta g 

<210> 362 

<211> 302 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<400> 362 _ 
Met Leu Asn Glu Asn Asn Gly Ala Thr Phe Lys Val Ala Ala Val Gin 

15 10 15 

Ala Ser Pro Val Phe Leu Asp Arg Ala Tyr Asn Arg Gin Gly Leu Arg 

20 25 30 

Phe Asn Cys Tyr Gly Arg Thr Arg Gly Gly Ser Pro Asp Arg Leu Ser 

35 40 45 

Arg Ser Val Arg Pro Gly Leu Ser Leu Gly Met Gly Asp Ser Arg Gly 
50 55 60 
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Gly 


His 


Ala 


Gin 


Ala 


Val 


Cys 


Glu 


Leu 


Leu 


Ala 


Asn 


Ala 


Val 


Thr 


He 


65 










70 










75 










80 


Pro 


Ser 


Asp 


Ala 


Thr 


Glu 


Arg 


Leu 


Cys 


Arg 


Ala 


Ala 


Arg 


Leu 


Ala 


Asn 










85 










90 










95 




Ala 


Tyr 


Val 


Val 


Met 


Gly 


Met 


Ser 


Glu 


Arg 


Asn 


Ala 


Glu 


Ala 


Ser 


Gly 






100 










105 










110 






Ala 


Ser 


Leu 


Tyr 


Asn 


Thr 


Leu 


Leu 


Tyr 


Cys 


Thr 


Gly 


Thr 


Asn 


Pro 


Gly 






115 










120 










125 








Lys 


Ala 


Pro 


Glu 


Ala 


Gly 


Ser 


Asn 


Gly 


Arg 


Arg 


Ala 


Pro 


Gly Met 


Gly 




130 










135 










140 










Thr 


Gly 


Arg 


Trp 


Gin 


His 


Pro 


Gly 


Gly 


Leu 


Arg 


Tyr 


Ser 


Phe 


Gly 


Lys 


145 










150 










155 










160 


Thr 


Arg 


Trp 


Leu 


Asn 


Leu 


Leu 


Arg 


He 


He 


Cys 


Arg 


Trp 


His 


Ala 


He 










165 










170 










175 




Leu 


Cys 


Met 


Pro 


Gly 


Ala 


Arg 


Lys 


Ser 


Thr 


Leu 


Gin 


Arg 


Arg 


Gly 


He 






180 










185 










190 






Ala 


Gly 


Ser 


His 


Gly 


Tyr 


Pro 


Leu 


Cys 


Asp 


Thr 


Leu 


Leu 


Lys 


Arg 


Ala 






195 










200 










205 








Glu 


Tyr 


Met 


Ser 


Ala 


Val 


Val 


Leu 


Tyr 


Ala 


Gin 


Arg 


Tyr 


Pro 


Arg 


Pro 




210 










215 










220 










Leu Arg Asp 


Glu 


Gly 


Glu 


Val 


Leu 


Arg 


Gly 


Arg 


Arg 


Arg 


Val 


Asp 


Gin 


225 










230 










235 










Z 4 U 


Tyr 


Trp 


Arg 


Arg 


Asn 


Arg 


Gin 


Ser 


Arg 


Arg 


Gly 


He 


Tyr 


Cys 


Arg 


Ala 






245 










250 










255 




Ser 


Ala 


Ala 


Arg 


Arg 


He 


Leu 


Tyr 


Ala 


Glu 


Val 


Asp 


Pro 


Arg 


Met 


Met 








260 










265 










270 






Gin 


Gly 


Pro 


Lys 


Trp 


Met 


Leu 


Asp 


Val 


Ala 


Gly 


His 


Tyr 


Arg 


Ala 


Arg 






275 










280 










285 








Met 


Tyr 


Ser 


Ser 


Arg 


Cys 


Thr 


Arg 


Arg 


Gly 


Gly Arg 


Ser 


Thr 







290 295 300 



<210> 363 
<211> 1053 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 363 

atgccgaaaa agtcgaccgt ccgggtcgca gccgtccaga ttgcgccgga 

cgggaaaaga cggtggcacg cgtgatcgag gcgatcgccc aggcatccgc 

gagcttgtgg tttttcccga gacctttgtg ccgtggtatc cttatttctc 

cccccggtct tgtcgggcaa ggagcacctg cggctctacg aagaggcggt 

agtgccgcca caagaagcgt agcggctgcc gctcgcgaac atggcatcgt 

ggcgtcaacg agcgcgacta tggcacgctc tacaatacgc aactgctttt 

ggcagtctga tcctgaagcg gcgcaagatc accccgactt tccacgagcg 

ggccagggcg atgcctcagg cctgaaggtt gtcgacagcg ccattggccg 

ctggcctgct gggaacacta caatccgcta gcccgctatg cgctgatggc 

gaaatccaca ttgcgcagtt tcccggctcc atggtcgggc cgatctttgc 

gaggtgacga tccgccatca cgcgctggaa agcggctgct tcgtcgtcaa 

tggctgacgg atgatcagat cgtctcgatc acaccggata ccggcctgca 

cggggtggct gcatgacggc gatcatttcc cccgaaggca agcatctcgt 

accgaaggtg agggtatcct cgtcgccgat ctcgacatga gcctcattct 

cgcatgatgg attcggtcgg ccactatgcc cggcccgagt tgctgcacct 

gcccggccgg ctgcgccgat gagggaatcg tccatgccca ctgccttccc 
ttgacaaccg acatgaccga tggagaacag gatgcgtctt tcgacggaaa 
cgaattgcag tccttcggag cccggctggt tga 

<210> 364 
<211> 335 



tctgacatcg 60 

caaaggtgcg 120 

gttcgtgttg 180 

tgcggtgcca 2 40 

cgtggcgctt 300 

cgatgccgat 360 

gatgatctgg 420 

catcggcgcg 480 

gcagcacgag 540 

cgatcagatg 600 

tgccacggga 660 

aaaagcgctg 720 

gccgccgctc 780 

caagcgcaag 84 0 

cgtcatggac 900 

cggcgaaaca 960 

cgctgatcaa 1020 
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<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 364 



Met 


Pro 


Lys 


Lys 


Ser 


Thr 


Val 


Arg 


Val 


Ala 


Ala 


Val 


Gin 


He 


Ala 


Pro 


1 






5 










10 










15 




Asp 


Leu 


Thr 


Ser 


Arg 


Glu 


Lys 


Gly 


Gly 


Thr 


Arg 


Asp 


Arg 


Gly Asp 


Arg 








20 










25 










30 






Pro 


Gly 


He 


Arg 


Gin 


Arg 


cys 


Gly 


Ala 


Cys 


Gly 


Phe 


Ser 


Arg 


Asp 


Leu 






35 










40 










45 








Cys 


Arg 


Gly 


He 


Leu 


He 


Ser 


Arg 


Ser 


Cys 


Cys 


Pro 


Arg 


Ser 


Cys 


Arg 




50 










55 










60 










Ala 


Arg 


Ser 


Thr 


Cys 


Gly 


Ser 


Thr 


Lys 


Arg 


Arg 


Leu 


Arg 


Cys 


Gin 


Val 


65 










70 










75 










8 0 


Pro 


Pro 


Gin 


Glu 


Ala 


Arg 


Leu 


Pro 


Leu 


Ala 


Asn 


Met 


Ala 


Ser 


Ser 


Trp 










85 










90 










95 




Arg 


Leu 


Ala 


Ser 


Thr 


Ser 


Ala 


Thr 


Met 


Ala 


Arg 


Ser 


Thr 


He 


Arg 


Asn 






100 










105 










110 






Cys 


Phe 


Ser 


Met 


Pro 


Met 


Ala 


Val 


Ser 


Ser 


Gly Ala 


Arg 


Ser 


Pro 


Arg 




115 










120 










IOC 








Leu 


Ser 


Thr 


Ser 


Gly 


Ser 


Gly 


Ala 


Arg 


Ala 


Met 


Pro 


Gin 


Ala 


Glu 


fc»iy 




130 










135 










140 










Cys 


Arg 


Gin 


Arg 


His 


Trp 


Pro 


His 


Arg 


Arg 


Ala 


Gly 


Leu 


Leu 


Gly 


i nr 


145 










150 










155 










1 oU 


Leu 


Gin 


Ser 


Ala 


Ser 


Pro 


Tyr 


Ala 


Leu 


Met 


Ala 


Gin 


His 


Glu 


Glu 












165 










170 










175 




His 


He 


Ala 


Gin 


Phe 


Pro 


Gly 


Ser 


Met 


Val 


Gly 


Pro 


Tin 

l J_e 


Phe 


Ala 


Asp 








180 










185 










1 QO 

-L \J 






Gin 


Met 


Glu 


Val 


Thr 


He 


Arg 


His 


His 


Ala 


Leu 


Glu 


Ser 


Gly 


Cys 


Phe 






195 










200 










O AC 








vai 


Val 


Asn 


TV 1 -i 


Thr 


laJ_y 


Trp 


Arg 


Mot- 
Lit: L. 


He 


Arg 


Ser 


Ser 


Arg 


Ser 


His 




210 










215 










220 










Arg 


He 


Pro 


Ala 


Cys 


Lys 


Lys 


Arg 


Cys 


Gly Val 


Ala 


Ala 


Gly 


Asp 


His 


225 










230 










235 










240 


Phe 


Pro 


Arg 


Arg 


Gin 


Ala 


Ser 


Arg 


Ala 


Ala 


Ala 


His 


Arg 


Arg 


Gly 


Tyr 






245 










250 










255 




Pro 


Arg 


Arg 


Arg 


Ser 


Thr 


Ala 


Ser 


Phe 


Ser 


Ser 


Ala 


Ser 


Ala 


Trp 


He 








260 










265 










270 






Arg 


Ser 


Ala 


Thr 


Met 


Pro 


Gly 


Pro 


Ser 


Cys 


Cys 


Thr 


Ser 


Ser 


Trp 


Thr 




275 










280 










285 








Pro 


Gly Arg 


Leu 


Arg 


Arg 


Gly 


Asn 


Arg 


Pro 


Cys 


Pro 


Leu 


Pro 


Ser 


Pro 




290 










295 










300 










Ala 


Lys 


He 


Asp 


Asn 


Arg 


His 


Asp 


Arg 


Trp 


Arg 


Thr 


Gly 


Cys 


Val 


Phe 


305 






310 










315 










320 


Arg 


Arg 


Lys 


Arg 


Ser 


Thr 


Asn 


Cys 


Ser 


Leu 


Arg 


Ser 


Pro 


Ala 


Gly 





325 330 335 



<210> 365 
<211> 975 
<212> DNA 
<213> Unknown 

<220> 

<22 3> Obtained from an environmental sample 
<400> 365 

atgacaaagc tagccatcgt tcaaaaaccc cccgtctttc tggataaaga aaaaaccata 60 
gcgaagacgg ttgattccat aaaagaggcc gcgacacaaa atgccgactt ggtcatcttc 12 0 
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accgaagcct tcatcccggg ctaccccacc tggatatggc gacttaggcc aggcgctgat 18 0 

tggggcctct cagaagagct gcacgagcag ttattgcgta acgcggtgag tatgggatcg 24 0 

accgacctgg atccgcttta tgaagccgcc caacagcata acgtcactat tgtttgcggc 300 

atcgtagaaa gagaccacca actcagccaa tcaaccctct acaacagcat ggtcgtcatt 360 

gacacagacg gaacccttct caacaagcac cgcaaactca tgccaactaa tcccgaacgc 420 

atggtgtggg gctttggcga cgcctcagga ctcaaagccg ttgcaacacc tgcaggccgc 4 80 

atcagcacgt tgttgtgttg ggagaactac atgccattgg cccgatatgc tctgtatgca 540 

caaggcgtgg aaatctatat cgcgccaact tacgacagtg gtgcgggttg gataggaagc 600 

ttgcaacaca tagcacgcga aggtcgatgc tgggtcgtgg gctgtggcaa cctgattcag 660 

gccagtgatc tgcctgaaga cttcccggac aaggacaacc tctacccgga cgcagaagag 720 

tgggtgaacc cgggtgactc catagtcatt gcaccagacg gtgagattgt ggccggtcca 7 80 

atgcacaaag agacaggaat tttgtactgc gagatagatc tggagaaagt cagaattgca 840 

aaacgagcat tagacgtgac cgggcattat tcgcgaccgg acgttttcaa actgcatgtg 900 

aatacccgac ctcaatcacc tgtggaattt gaaggtcagg agaccaacaa tccaacaaca 960 

ggagaaagct catga 975 



<210> 366 
<211> 315 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 366 



Met 


Thr 


Lys 


Leu 


Ala 


He 


Val 


Gin 


Lys 


Pro 


Pro 


Val 


Phe 


Leu 


Asp 


Lys 


1 






5 










10 










15 




Glu 


Lys 


Thr 


lie 


Ala 


Lys 


Thr 


Val 


lie 


Jr X D 


iiy s 




Pro 


Arg 


His 


Lys 






20 










25 










30 






Met 


Pro 


Thr 


Trp 


Ser 


Ser 


Ser 


Pro 


Lys 


Pro 


Ser 


Ser 


Arg 


Ala 


Thr 


Thr 






35 










40 










45 








Trp 


lie 


Trp 


Arg 


Leu 


Arg 


Pro 


Gly 


Ala 


Asp 


Trp 


Gly 


Leu 


Ser 


Glu 


Glu 




50 










55 










60 










Leu 


His 


Glu 


Gin 


Leu 


Leu 


Arg 


Arg 


Gly 


Glu 


Tyr 


Gly 


He 


Asp 


Arg 


Pro 


65 










70 










75 










80 


.Gly 


Ser 


Ala 


Leu 


Ser 


Arg 


Pro 


Thr 


Ala 


Arg 


His 


Tyr 


Cys 


Leu 


Arg 


His 








85 










90 










95 




Arg 


Arg 


Lys 


Arg 


Pro 


Pro 


Thr 


Gin 


Pro 


He 


Asn 


Pro 


Leu 


Gin 


Gin 


His 






100 










105 










110 






Gly 


Arg 


His 


His 


Arg 


Arg 


Asn 


Pro 


Ser 


Gin 


Gin 


Ala 


Pro 


Gin 


Thr 


His 




115 










120 










125 








Ala 


Asn 


Ser 


Arg 


Thr 


His 


Gly 


Val 


Gly 


Leu 


Trp 


Arg 


Ala 


Ser 


Gly 


Leu 




130 










135 










140 










Lys 


Ala 


Val 


Ala 


Thr 


Pro 


Ala 


Gly 


Arg 


He 


Ser 


Thr 


Leu 


Leu 


Cys 


Trp 


145 










150 










155 










160 


Glu 


Asn 


Tyr 


Met 


He 


Gly 


Pro 


He 


Cys 


Ser 


Val 


Cys 


Thr 


Arg 


Arg 


Gly 








165 










170 










175 




Asn 


Leu 


Tyr 


Arg 


Ala 


Asn 


Leu 


Arg 


Gin 


Trp 


Cys 


Gly 


Leu 


Asp 


Arg 


Lys 






180 










185 










190 






Leu 


Ala 


Thr 


His 


Ser 


Thr 


Arg 


Arg 


Ser 


Met 


Leu 


Gly 


Arg 


Gly 


Leu 


Trp 






195 










200 










205 








Gin 


Pro 


Asp 


Ser 


Gly 


Ser 


Asp 


Leu 


Pro 


Glu 


Asp 


Phe 


Pro 


Asp 


Lys 


Asp 




210 










215 










220 










Asn 


Leu 


Tyr 


Pro 


Asp 


Ala 


Glu 


Glu 


Trp 


Val 


Asn 


Pro 


Gly Asp 


Ser 


He 


225 








230 










235 










240 


Val 


lie 


Ala 


Pro 


Asp 


Gly 


Glu 


He 


Val 


Ala 


Gly 


Pro 


Met 


His 


Lys 


Glu 










245 










250 










255 




Thr 


Gly 


lie 


Leu 


Tyr 


Arg 


Asp 


Arg 


Ser 


Gly 


Glu 


Ser 


Gin 


Asn 


Cys 


Lys 






260 










265 










270 






Thr 


Ser 


lie 


Arg 


Arg 


Asp 


Arg 


Ala 


Leu 


Phe 


Ala 


Thr 


Gly Arg 


Phe 


Gin 



275 280 285 
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Thr Ala Cys Glu Tyr Pro Thr Ser lie Thr Cys Gly He Arg Ser Gly 

290 295 300 

Asp Gin Gin Ser Asn Asn Arg Arg Lys Leu Met 
305 310 315 



<210> 367 
<211> 981 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 367 

atgactgccc 

ctacagcgcg 

ggcgcgggcc 

cacacggacg 

gatcagtcgg 

caccagattt 

tttgcatcaa 

atgccgacct 

catccgctcg 

gcgcgcgccg 

cgccgcaaca 

tccgccagtt 

ctgcgcgact 

ggacatttcc 

catgcccggg 

gaactcctgt 

gaccatggct 



actttgccga 
aagctaccct 
tggtcgcctt 
gcgcccgctt 
tccagatcaa 
gggtggtctg 
tggtcaccat 
acgaagaacg 
gccagttccg 
ccctgtacgc 
ccgagaccat 
ccgtgctgca 
gcctgccgga 
tgatcgagcc 
tacgcgagga 
cgctggtggt 
ttaagccctg 



taccctgacc 
gcaaaagatg 
cagtgagggc 
cgagtcgccg 
tgctgggcac 
cggcgtcatc 
cgatgccgaa 
cctggtgtgg 
cctcggcagc 
ccagggcgag 
tactcccttc 
ccgggatgat 
cgtgatggcc 
ggtcgtcggc 
acgtcagaac 
ggatacgcgc 



gttgccgtgg 
ctggtctggg 
ctgctgcctg 
ttacagaagc 
ctcgcaccac 
gagcgcgaca 
ggcgcgatcc 
tcgcccggcg 
ctcaattgct 
tctttgcatg 
atcgcccgcg 
ctgcccgact 
gacggcggct 
cgggaagagc 
ttcgacccct 
cgggcgagcg 



cccagatcgc 
tggagcgcgc 
gctacccctt 
gcctgtacgc 
tctgcgctgc 
gcgcacgcgg 
gcagtgtgca 
acgcgcacgg 
gggagaactg 
ttgcatcctg 
aaggccgcag 
ccgttccggc 
cctgcgtcgc 
tgctgctcgc 
tcggccacta 
gagtgcagat 



gccggtgtgg 
cgcgcgcgaa 
ctggattgaa 
ccactactgc 
cgcggccaga 
actcagcgtg 
ccgcaagctg 
actgcgctgc 
gatgccgctg 
gcccggcagt 
ttacgcgctg 
gctgtcggtg 
cggccccgac 
gcagatcgat 
ctcgcggccg 
agtgaatgct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
981 



<210> 368 
<211> 317 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 368 



Met 


Thr 


Ala 


His 


1 








Ala 


Pro 


Val 


Trp 








20 


Gly 


Gly 


Ala 


Arg 






35 




Pro 


Ala 


Ala 


Trp 




50 






Arg 


Val 


Ala 


Val 


65 








Ser 


Arg 


Ser 


Met 


His 


Gin 


He 


Trp 








100 


Gly 


Leu 


Ser 


Val 






115 




Ser 


Ala 


Val 


Cys 




130 






Gly 


Arg 


Pro 


Ala 



145 



Phe 


Ala 


Asp 


Thr 


5 








Leu 


Gin 


Arg 


Glu 


Arg 


Ala 


Arg 


Arg 








40 


Leu 


Pro 


Leu 


Leu 






55 




Thr 


Glu 


Ala 


Cys 




70 






Leu 


Gly 


Thr 


Ser 


85 








Val 


Val 


Cys 


Gly 


Phe 


Ala 


Gin 


Trp 








120 


Thr 


Ala 


Ser 


Cys 






135 




Thr 


Arg 


Thr 


Asp 




150 







Leu Thr Val Ala 
10 

Tyr Pro Ala Lys 
25 

Arg Gly Pro Gly 

Asp Thr His Gly 
60 

Thr Pro Thr Thr 
75 

His His Ser Ala 
90 

Val He Glu Arg 
105 

Ser Pro Ser Met 

Arg Pro Thr Lys 
140 

Cys Ala Ala He 
155 



Val 


Ala 


Gin 


He 






15 




Asp 


Ala 


Gly 


Leu 




30 






Arg 


Leu 


Gin 


Gly 


45 








Arg 


Arg 


Pro 


Leu 


Ala 


He 


Ser 


Arg 








80 


Leu 


Pro 


Ala 


Arg 






95 




Asp 


Ser 


Ala 


Arg 




110 






Pro 


Lys 


Ala 


Arg 


125 








Asn 


Ala 


Trp 


Cys 


Arg 


Ser 


Ala 


Ser 








160 
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Ser 


Ala 


Ser 


Ala 


Ala 
1 65 


Gin 


Leu 


Leu 


Gly 


Glu 

1 /u 


Leu 


Asp 


Ala 


Ma 


tn±y 

1 i D 


Aid 


Arg 


Arg 


Pro 


Val 
180 


Arg 


Pro 


Gly 


Arg 


Val 
185 


Phe 


Ala 


Cys 


Cys 


lie 

ion 

i y o 


Leu 


Ala 


Arg 


Gin 


Ser 


Pro 


Gin 


His 


Arg 


Asp 


His 


Tyr 


Ser 


Leu 


His 


Arg 


Pro 


Arg 




195 










200 










one; 








Arg 


Pro 


Gin 


Leu 


Arg 


Ala 


Ser 


Ala 


Ser 


Ser 


Val 


Leu 


His 


Arg 


Asp 


Asp 


210 










215 










220 










Leu 


Pro 


Asp 


Ser 


Val 


Pro 


Ala 


Leu 


Ser 


Val 


Leu 


Arg 


Asp 


Cys 


Leu 


Arg 


225 








230 










235 










240 


Thr 


Trp 


Pro 


Thr 


Ala 


Ala 


Pro 


Ala 


Ser 


Pro 


Ala 


Pro 


Thr 


Asp 


lie 


Ser 


















250 










255 




Ser 


Ser 


Arg 


Ser 


Ser 


Pro 


Gly 


Arg 


Ala 


Ala 


Ala 


Arg 


Ala 


Asp 


Arg 


Ser 






260 










265 










270 






Cys 


Pro 


Gly 
275 


Thr 


Arg 


Gly 


Thr 


Ser 
280 


Glu 


Leu 


Arg 


Pro 


Leu 
285 


Gly 


His 


Tyr 


Ser 


Arg 
290 


Pro 


Glu 


Leu 


Leu 


Ser 
295 


Leu 


Val 


Val 


Asp 


Thr 
300 


Arg 


Arg 


Ala 


Ser 


Gly 


Val 


Gin 


He 


Val 


Asn 


Ala 


Asp 


His 


Gly 


Phe 


Lys 


Pro 









305 310 315 



<210> 369 
<211> 1074 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 369 

atgtccgaag 

gtccaggegg 

attgecgagg 

tcgtacccgg 

tatgecaaac 

agggeggetc 

gcgagcggcg 

ggcattcatc 

ggaagcaege 

gaaaactata 

gcggcgacct 

ggccagacct 

gagctegteg 

attgtcgatc 

tacgccgcga 

cactacgcgc 

accatgaacg 

cgcgacggcc 



agaacaccac 
cgccggtgtt 
ccgcggcgca 
attgggcgtg 
tgctcgacaa 
gcgacgcggg 
gaagtctcta 
ggaagttggt 
ttggcgtgtt 
tgccgctggc 
gggacagggg 
aegtgatagg 
agaagtatta 
egageggaac 
tcgatcgcca 
gtccggacgt 
aaccaagcgc 
tacgegggeg 



caccgaatca 
ettgaategg 
cggcgcgcgt 
ggcggtcccg 
ttcggttacg 
cgcctacgtc 
caacagcttg 
gccgacctcg 
cgatacgccg 
ccgatattcg 
egagecatgg 
ctgttgcatc 
cgcggacgcc 
aatcattgee 
gaaggtgctg 
gttttcattt 
gaeggecgag 
tcgtgaccct 



ageteggacg 
gatgegaegg 
ctcattgttt 
cccggacagg 
gtgcccagcc 
gtcatgggca 
ctctatattg 
geggagegge 
ggcggccggc 
atgtacgcgc 
ctttcgacgc 
gecatgegga 
ggtgagtgga 
ggtccggccc 
gaatcaaaat 
ggcgttcgaa 
ccgaggcaca 
cgctcgagaa 



cgatctggac 
tgegtaagge 
ttcccgaggc 
gcggaaccaa 
cagccaccga 
taaacgagcg 
gtceggaegg 
tgatctgggc 
ttggcggctt 
gcggcgtgca 
tgegacacat 
cggccgacat 
tcaatgaagg 
atcagaccaa 
ggatgttgga 
ccgatgccaa 
atagegeggg 
tteggcagat 



cgttgcggcc 
cgtctcgctg 
gttcattcca 
ctcgagactg 
tgccctggcc 
gaacacggcg 
tcgcatcctg 
acaaggegat 
gatctgetgg 
aatatatgtt 
agecaaggaa 
cgacgacgcc 
cgacagcgcg 
tgaaatcctc 
cgtggccggg 
cccgataatg 
agecgagggt 
gtag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1074 



<210> 370 
<211> 346 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 370 

Met Ser Glu Glu Asn Thr Thr Thr Glu Ser Ser Ser Asp Ala lie Trp 

1 5 10 15 

Thr Val Ala Ala Val Gin Ala Ala Gly Val Leu Glu Ser Gly Cys Asp 
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20 25 30 

Gly Ala Gly Arg Leu Ala Asp Cys Arg Gly Arg Gly Ala Arg Ala Arg 

35 40 45 

Leu lie Val Phe .Pro Glu Ala Phe lie Pro Ser Tyr Pro Asp Trp Ala 

50 55 60 

Trp Ala Val Pro Pro Gly Gin Ala Glu Pro Thr Arg Asp Cys Met Pro 
65 70 75 80 

Asn Cys Ser Thr lie Arg Leu Arg Cys Pro Ala Gin Pro Pro Met Pro 

85 90 95 

Gly Gin Gly Gly Ser Arg Arg Gly Arg Leu Arg Arg His Gly His Lys 

100 105 110 

Arg Ala Glu His Gly Gly Ser Gly Gly Ser Leu Tyr Asn Ser Leu Leu 

115 120 125 

Tyr lie Gly Pro Asp Gly Arg lie Leu Gly lie His Arg Lys Leu Cys 

130 135 140 

Arg Pro Arg Arg Ser Gly Ser Gly His Lys Ala Met Glu Ala Arg Leu 
145 150 155 160 

Ala Cys Ser lie Arg Arg Ala Ala Gly Leu Ala Ala Ser Ala Gly Lys 

165 170 175 

Thr lie Cys Arg Trp Pro Asp lie Arg Cys Thr Arg Ala Val Gin He 

180 185 190 

Tyr Val Ala Ala Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu 

195 200 205 

Arg His lie Ala Lys Lys Ala Arg Pro Thr Ala Val Ala Ser Pro Cys 

210 215 220 

Gly Arg Pro Thr Ser Thr Thr Pro Ser Ser Ser Arg Ser He Thr Arg 
225 230 235 240 

Thr Pro Val Ser Gly Ser Met Lys Ala Thr Ala Arg Leu Ser He Arg 

245 250 255 

Ala Glu He lie Ala Gly Pro Ala His Gin Thr Asn Glu He Leu Tyr 

260 265 270 

Ala Ala He Asp Arg Gin Lys Val Leu Glu Lys Met Asp Val Gly Arg 

275 280 285 

Gly Arg Ala Leu Arg Ala Ser Gly Arg Val Phe He Trp Arg Ser Asn 

290 295 300 

Arg Cys Gin Pro Asp Asn Asp His Glu Arg Thr Lys Arg Asp Gly Arg 
305 310 315 320 

Ala Glu Ala Gin Arg Gly Ser Arg Val Ala Thr Ala Tyr Ala Gly Val 

325 330 335 

Val Thr Leu Ala Arg Glu Phe Gly Arg Cys 
340 345 

<210> 371 
<211> 1014 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 371 

atgggtatcg aacatccgaa gtacaaggtt gcggtggtac aggcggcgcc ggcctggctc 
gatctcgacg catcgatcgc caagaccatc gcgctgatcg aggaggcggc cgccaagggc 
gccaagctga tcgcattccc tgaggccttc attcccggct atccttggca tatctggatg 
gactcgccgg cctgggcgat cgggcgcgga tttgtgcagc gctatttcga caattcgctt 
tcctacgaca gcccgcaggc cgaacggctg cggctcgcgg tgaagaaggc cggcatcacc 
gccgtgctcg gcctgtccga gcgggaaggc ggcagccttt atctcgcgca atggctgatc 
ggtcccgacg gcgagaccat cgccaagcgg cgcaagctgc ggccgacgca tgccgagcgc 
accgtctacg gcgaaggcga cggcagtgac cttgcggtcc atgaccgcgc tgacattggc 
cgtctcggcg cgctgtgctg ctgggaacat ctgcagccgc tgtcgaaata cgccatgtat 
gcccagaacg agcaggtgca tgtcgcggcc tggccgagtt tttcgctgta cgacccgttc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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gcgccggcgc tgggctggga ggtcaacaac gcggcatccc gcgtctatgc ggtcgaaggc 660 

tcctgcttcg tgctggcgcc gtgtgccacc gtctcgcagg cgatggtgga cgaactctgc 720 

gaccgcgacg acaagcatgc gctgctgcat gtcggcggcg gccacgccgc gatctacgga 780 

ccggacggca gctcgatggc gaacaagctc gatcccgagc aggagggcct gctgttcgcc 840 

gacatcgatc tcggggcgat cggggtggca aagaacgccg ccgatccggc cgggcactat 900 

tcgcggccgg atgtgacccg tctgctcttg aacagaaaac cctcaaagcg cgtcgagcac 960 

tttgcgctgc cgctcgacca tctcgcggac gagggcgttg ctccggtgac ctga 1014 



<210> 372 
<211> 327 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 372 



Met 


Gly 


lie 


Glu 


His 


Pro 


Lys 


Tyr 


Lys 


Val 


Ala 


Val 


Val 


Gin 


Ala 


Ala 


1 






5 










10 










15 




Pro 


Ala 


Trp 


Leu 


Asp 


Leu 


Asp 


Ala 


Arg 


Ser 


Pro 


Arg 


Pro 


Ser 


Arg 


Ser 






20 










25 










30 ' 






Arg 


Arg 


Arg 


Pro 


Pro 


Arg 


Ala 


Pro 


Ser 


Ser 


His 


Ser 


Leu 


Ala 


Phe 


He 






35 










40 










45 








Pro 


Gly 


Tyr 


Pro 


Trp 


His 


He 


Trp 


Met 


Asp 


Ser 


Pro 


Ala 


Trp 


Ala 


He 




50 










55 










60 










Gly 


Arg 


Gly 


Phe 


Val 


Gin 


Ala 


He 


Ser 


Thr 


He 


Arg 


Phe 


Pro 


Thr 


Thr 


65 








70 










75 










80 


Ala 


Arg 


Arg 


Pro 


Asn 


Gly 


Cys 


Gly 


Ser 


Arg 


Arg 


Arg 


Pro 


His 


His 


Arg 










85 










90 










95 




Arg 


Ala 


Arg 


Pro 


Val 


Arg 


Ala 


Gly 


Arg 


Arg 


Gin 


Pro 


Leu 


Ser 


Arg 


A±a 






100 










105 










110 






Met 


Ala 


Asp 


Arg 


Ser 


Thr 


Ala Arg 


Pro 


Ser 


Pro 


Ser 


Gly 


Ala 


Ser 


Cys 






115 










120 










"IOC 








Gly 


Arg 


Arg 


Met 


Pro 


Ser 


Ala 


Pro 


Ser 


± Hi. 


7\ 1 =, 


Lys 


Ala 


Thr 


Ala 


Val 


130 








135 










140 










Thr 


Leu 


Arg 


Ser 


Met 


Thr 


Ala 


Leu 


Thr 


Leu 


Ala 


Val 


Ser 


Ala 


Arg 


Cys 


145 








150 










155 










160 


Ala 


Ala 


Gly 


Asn 


Leu 


Gin 


Pro 


Leu 


Ser 


Lys 


Tyr 


Ala 


Met 


Tyr 


Ala 


Gin 








165 










170 










175 




Asn 


Glu 


Gin 


Val 


His 


Val 


Ala 


Ala 


Trp 


Pro 


Ser 


Phe 


Ala 


Val 


Arg 


Pro 








180 










185 










190 






Val 


Arg 


Ala 


Gly Ala 


Gly 


Leu 


Gly 


Gly 


Gin 


Gin 


Arg 


Gly 


He 


Pro 


Arg 






195 










200 










205 








Leu 


Cys 


Gly 


Arg 


Gly 


Ser 


Cys 


Phe 


Val 


Leu 


Ala 


Pro 


Cys 


Ala 


Thr 


Val 




210 










215 










220 










Ser 


Gin 


Ala 


Met 


Val 


Asp 


Glu 


Leu 


Cys 


Asp 


Arg 


Asp 


Gin 


Ala 


Cys 


Ala 


225 










230 










235 










240 


Ala 


Ala 


Cys 


Arg 


Arg 


Arg 


Pro Arg 


Arg 


Asp 


Leu 


Arg 


Thr 


Gly Arg 


Gin 










245 










250 










255 




Leu 


Asp 


Gly 


Thr 


Ser 


Ser 


He 


Pro 


Ser 


Arg 


Arg 


Ala 


Cys 


Cys 


Ser 


Pro 








260 










265 










270 






Thr 


Ser 


He 


Ser 


Gly Arg 


Ser 


Gly 


Trp 


Gin 


Arg 


Thr 


Pro 


Pro 


He 


Arg 






275 










280 










285 








Pro 


Gly 


Thr 


He 


Arg 


Gly Arg 


Met 


Pro 


Val 


Cys 


Ser 


Thr 


Glu 


Asn 


Pro 




290 










295 










300 










Gin 


Ala 


Arg 


Arg 


Ala 


Leu 


Cys 


Ala 


Ala 


Ala 


Arg 


Pro 


Ser 


Arg 


Gly 


Arg 


305 






310 








315 










320 


Gly 


Arg 


Cys 


Ser 


Gly Asp 


Leu 





















325 



<210> 373 
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<211> 1056 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 373 

atgacgaatt 

cgggaggcat 

acgctcgcag 

gcgcacaacc 

attccgagtc 

gtcattggcg 

ttcatcggtc 

gagcggacga 

gggcggctga 

gccgcacagg 

ccgccgaacg 

gccggcgcgt 

tatcgcgagt 

ggcgaggtca 

gaagctctcc 

atctttcagc 

ttcacggcgg 

cccttcgacc 



ttctggacgt 
cgacggagaa 
cgttcggcga 
ggagcctgtt 
ccacaacgga 
tcgttgaact 
gcgacgggac 
tctggggcga 
gcggcctgaa 
gcacgcagtt 
aagcggctta 
acgtgatcag 
tagggcggtc 
ttgccgggcc 
tcgcggccaa 
tgcgtgtcaa 
cgatcggaca 
tggcggaatc 



gacggtggca 
agcgtgccgg 
aacctggttg 
ctggcaggcc 
ccgtctctgt 
ggatgaacga 
gatcctgggc 
aggggacgca 
ttgctgggaa 
tcacgtcgcc 
tacgcgccag 
cgtcgccggg 
atatgagttg 
tgcaaaaggc 
gtccgccatc 
cgatcaactg 
tatcggagcc 
tctcacgaac 



gcggttcagg 
ttgatccacg 
ccagggtatc 
gccgccgagt 
gcggcggcga 
acacgagctt 
cgccaccgca 
tatggactcc 
cacaatatga 
gcatggcccg 
cttctcctct 
ctgctcgcac 
accggcgaca 
gagaccatcc 
gacctcggcg 
cagcatcggg 
gagcgccgat 
tcctag 



cggctcccgt 
aagcggcagg 
cgttcttcgt 
acatcgccaa 
aggctgccgg 
cggtttacag 
agctgaagcc 
gcgtctacga 
tgctgcccgg 
gaaaggagag 
ctcgcgcgta 
cagactccat 
gcgtcatcgt 
tgctcgcgca 
gccattactc 
tccggagagt 
cccaggagga 



ctatttcgat 
gctcggcgcg 
ctgggggttc 
tgcggtggag 
ggtcgacgtc 
tacgctgctt 
aacacacatg 
acgttcgtac 
ctacgtgctt 
gctcaccgtc 
tgcatcccag 
gcccgagcgt 
cgacccgcgc 
gtgcagtcag 
acgcccggat 
tgagccacac 
tggtactggt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1056 



<210> 374 
<211> 351 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 374 



Met 


Thr 


Asn 


Phe 


Leu 


Asp 


Val 


Thr 


Val 


Ala 


Ala 


Val 


Gin 


Ala 


Ala 


Pro 


1 








5 










10 










15 




Val 


Tyr 


Phe 


Asp 


Arg 


Glu 


Ala 


Ser 


Thr 


Glu 


Lys 


Ala 


Cys 


Arg 


Leu 


He 








20 










25 










30 






His 


Glu 


Ala 


Ala 


Gly 


Leu 


Gly 


Ala 


Thr 


Leu 


Ala 


Ala 


Phe 


Gly 


Glu 


Thr 






35 










40 










45 








Trp 


Leu 


Pro 


Gly 


Tyr 


Pro 


Phe 


Phe 


Val 


Trp 


Gly 


Phe 


Ala 


His 


Asn 


Arg 




50 










55 










60 










Ser 


Leu 


Phe 


Trp 


Gin 


Ala 


Ala 


Ala 


Glu 


Tyr 


He 


Ala 


Asn 


Ala 


Val 


Glu 


65 










70 










75 










80 


lie 


Pro 


Ser 


Pro 


Thr 


Thr 


Asp 


Arg 


Leu 


Cys 


Ala 


Ala 


Ala 


Lys 


Ala 


Ala 










85 










90 










95 




Gly 


Val 


Asp 


Val 


Val 


lie 


Gly 


Val 


Val 


Glu 


Leu 


Asp 


Glu 


Arg 


Thr 


Arg 








100 










105 










110 






Ala 


Ser 


Val 


Tyr 


Ser 


Thr 


Leu 


Leu 


Phe 


He 


Gly 


Arg 


Asp 


Gly 


Thr 


He 






115 










120 










125 








Leu 


Gly 


Arg 


His 


Arg 


Lys 


Leu 


Lys 


Pro 


Thr 


His 


Met 


Glu 


Arg 


Thr 


He 




130 










135 










140 










Trp 


Gly 


Glu 


Gly 


Asp 


Ala 


Tyr 


Gly 


Leu 


Arg 


Val 


Tyr 


Glu 


Arg 


Ser 


Tyr 


145 










150 










155 










160 


Gly 


Arg 


Leu 


Ser 


Gly 


Leu 


Asn 


Cys 


Trp 


Glu 


His 


Asn 


Met 


Met 


Leu 


Pro 










165 










170 










175 




Gly 


Tyr 


Val 


Leu 


Ala 


Ala 


Gin 


Gly 


Thr 


Gin 


Phe 


His 


Val 


Ala 


Ala 


Trp 








180 










185 










190 






Pro 


Gly 


Lys 


Glu 


Arg 


Leu 


Thr 


Val 


Pro 


Pro 


Asn 


Glu 


Ala 


Ala 


Tyr 


Thr 
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195 










200 










205 








Arg 


Gin 
210 


Leu 


Leu 


Leu 


Ser 


Arg 
215 


Ala 


Tyr 


Ala 


Ser 


Gin 
220 


Ala 


Gly 


Ala 


Tyr 


Val 


He 


Ser 


Val 


Ala 


Gly 


Leu 


Leu 


Ala 


Pro 


Asp 


Ser 


Met 


Pro 


Glu 


Arg 


225 










230 










235 










240 


Tyr 


Arg 


Glu 


Leu 


Gly Arg 


Ser 


Tyr 


Glu 


Leu 


Thr 


Gly Asp 


Ser 


Val 


He 










245 










250 










255 




Val 


Asp 


Pro 


Arg 
260 


Gly 


Glu 


Val 


He 


Ala 
265 


Gly 


Pro 


Ala 


Lys 


Gly 
270 


Glu 


Thr 


He 


Leu 


Leu 
275 


Ala 


Gin 


Cys 


Ser 


Gin 
280 


Glu 


Ala 


Leu 


Leu 


Ala 
285 


Ala 


Lys 


Ser 


Ala 


He 
290 


Asp 


Leu 


Gly 


Gly 


His 
295 


Tyr 


Ser 


Arg 


Pro 


Asp 
300 


He 


Phe 


Gin 


Leu 


Arg 


Val 


Asn 


Asp 


Gin 


Leu 


Gin 


His 


Arg 


Val 


Arg 


Arg 


Val 


Glu 


Pro 


His 


305 










310 










315 










320 


Phe 


Thr 


Ala 


Ala 


He 
325 


Gly 


His 


He 


Gly 


Ala 
330 


Glu 


Arg 


Arg 


Ser 


Gin 
335 


Glu 


Asp 


Gly 


Thr 


Gly 
340 


Pro 


Phe 


Asp 


Leu 


Ala 
345 


Glu 


Ser 


Leu 


Thr 


Asn 
350 


Ser 





<210> 375 
<211> 939 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 375 

atgaccaaaa tcgctgtcat tcaagaaccc ccggtctatc tgaatctgag caaatcgatg 60 

gaccgagcgg tcgacttgat tgccgatgcc gcgagccagg ggtgtcagtt gattgtgttt 12 0 

cccgaagcct ggcttgcagg ttaccccacc ttcgtctggc gtcttgcgcc gggcagcgga 18 0 

atgggaaaaa ccgatgagct ttacgcgcgt ttgctcgcca actcggtcga ccgcagcaaa 24 0 

gaggggcttc ggcctttgca ggaggctgca aaagagcatg gcgttgtcat tgttctgggt 300 

tatcaagagg tggacggctc gggcagcagc agcacaatct tcaacagctg cgcgattatt 3 60 

gatgccgacg ggcgactggc caacaaccat cgcaagttga tgcccaccaa tgcggagcgg 42 0 

atggtgtggg ggtttggcga cggttcgggc ctgaacgttg ttgacaccgc ggtgggcagg 4 80 

atcggcacgc tgatttgctg ggaaaactac atgcccttgg cgcgctacgc gctgtatgcc 54 0 

caaaacatcg aaatctatgt ggcgccgacc tgggacagcg gtgccatgtg gcaagccacc 600 

ctgcaacata tcgcacgtga aggtggctgc tgggtcatcg gatgtgcaac ctcgctgcaa 6 60 

gcctctgaca tcccggacga ccttccccat cgggatgagt tattcccgaa caaagacgaa 720 

tgggtgaacc ctggcgatgc ggtggtttac aaaccttttg gcggccttgt ggccggcccc 7 80 

atgcatcagg aaaaggggct tctcatcgca gagttggacg tcgccgctgt tcaggtctca 84 0 

cgtcggaagt tcgatgcgac cgggcattac gctcgccccg atgtcttcca actgcacgtg 900 

aatcgcagcg cgatgcggcc ggttgagttc acgaattag 939 

<210> 376 
<211> 312 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 376 

Met Thr Lys He Ala Val He Gin Glu Pro Pro Val Tyr Leu Asn Leu 

15 10 15 

Ser Lys Ser Met Asp Arg Ala Val Asp Leu He Ala Asp Ala Ala Ser 

20 25 30 

Gin Gly Cys Gin Leu He Val Phe Pro Glu Ala Trp Leu Ala Gly Tyr 
35 40 45 
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XT i. U 


1 ill 


rne 


vai 


Trp 


Arg 


Leu 


Aia 


















rioJJ 


clu 


Leu 


Tyr 




Arg 


Leu 


Leu 












/ u 








biy 


Leu. 


Arg 


Pro 


Leu 


f 1 n 
bin 


bill 










0 0 








11c 


v ai 


Leu 


oiy 


Tyr 


bin 


LjIU 


v ai 








1UU 










lie 


rne 


Asn 


Ser 


Cys 


Ala 


lie 


TI- 
116 






1 1 c 

11D 










IzU 


Asn 


H.1S 


Arg 


Lys 


Leu 


Met 


Pro 


Thr 














loo 




rne 


P T T T 

bly 


ASp 


Gly 


Ser 


Gly 


Leu 


Asn 


14 J 










lju 






lie 


P 1 i r 

bly 


Thr 


Leu 


i-Le 


Cys 


Trp 


Glu 










1 DO 








Ala 


Leu 


Tyr 


Ala 


bin 


Asn 


lie 


P T 1 1 

GlU 








1 0 u 










oer 


biy 


iiia 


Meti 


Trp 


Gin 


Ala 


Thr 






iyo 










"> n rt 

zuu 


biy 


Cys 


Trp 


val 


He 


Gly 


Cys 


Ala 




z 1 u 










91 c 
ilD 




XT lO 


Asp 


Asp 


Leu 


Pro 


His 


Arg 


Asp 












230 






Trp 


val 


Asn 


Pro 


Gly Asp 


Ala 


val 










245 








Val 


Ala 


Gly 


Pro 


Met 


His 


Gin 


Glu 








260 










Asp 


Val 


Ala 


Ala 


Val 


Gin 


Val 


Ser 






275 










280 


His 


Tyr 


Ala 


Arg 


Pro 


Asp 


Val 


Phe 




290 










295 




Met 


Arg 


Pro 


Val 


Glu 


Phe 


Thr 


Asn 



305 310 



<210> 377 
<211> 1050 
<212> DNA 
<213> Unknown 



Pro 


P 1 t r 

biy 


Ser 


biy 


Met 


Gly 


Lys 


Thr 








en 
0 U 










Ala 


Asn 


Ser 


Val 


Asp 


Arg 


Ser 


Lys 






/ 0 










0 U 


Ala 


Ala 


Lys 


GlU 


His 


Gly 


Val 


Val 














y 0 




Asp 


Gly 


Ser 


Gly 


Ser 


Ser 


Ser 


Thr 


1 Uo 










11U 






Asp 


Ala 


Asp 


Gly 


Arg 


Leu 


Ala 


Asn 










izo 








Asn 


Ala 


Glu 


Arg 


Met 


Val 


Trp 


Gly 








1 A Pi 

140 










Val 


Val 


Asp 


Thr 


Ala 


Val 


Gly 


Arg 






155 










160 


Asn 


Tyr 


Met 


Pro 


Leu 


Ala 


Arg 


Tyr 




1 / U 










I/O 




He 


Tyr 


Val 


Ala 


Pro 


Thr 


Trp 


Asp 


1 QC 

100 










1 y u 






Leu 


Gin 


His 


He 


Ala 


Arg 


Glu 


Gly 










O Pi c 








Thr 


Ser 


Leu 


Gin 


Ala 


Ser 


Asp 


He 








zzu 










GlU 


Leu 


Phe 


Pro 


Asn 


Lys 


Asp 


Glu 






9 *3 ^ 










£. fi \J 


Val 


Tyr 


Lys 


Pro 


Phe 


Gly 


Gly 


Leu 




250 










255 




Lys 


Gly 


Leu 


Leu 


He 


Ala 


Glu 


Leu 


265 










270 






Arg 


Arg 


Lys 


Phe 


Asp 


Ala 


Thr 


Gly 










285 








Gin 


Leu 


His 


Val 


Asn 


Arg 


Ser 


Ala 



300 



<220> 

<223> Obtained from an environmental sample 



<400> 377 

atgtccagca 

ctcgacctcg 

ggcgtgcagc 

ctcgattcgc 

gtcggctcgc 

tcgctcggct 

gaccagggca 

gtgttcggcg 

ggctcgctgt 

aacgagcaga 

gcgctcggcg 

ttcgtgatcg 

cctggcaagc 

ggatcgccgc 

gacctcggca 

cccgacgtga 

ccggccgagc 



ccacccatcc 
acggcaccat 
tgategcett 
cggcctgggg 
ccgagttcga 
acagegagaa 
acacgctgca 
agggegaegg 
cgtgctggga 
tecattgegg 
ccgaagtgaa 
cgccctgcgc 
ageagatget 
teggcaagaa 
tgatctccct 
cgcagttgct 
ccgagaaggc 



ccgcctgcgc 
cgacaagacc 
tcccgagacc 
catgeagtte 
ccgcattcgc 
ggccgccggc 
gactcggcgc 
ateggacctg 
gcacctgcag 
cgcctggccc 
caacgccgcc 
cacggtctcg 
getggtegge 
cctggcagag 
ggccaaggcg 
gttcaacagg 
ggtcgecgag 



gtcgccgcgg 
atcgacctga 
tgggtgcccg 
gtgeageget 
gaggccgcgc 
agcctctaca 
aagctcaagc 
agcgtggtcg 
ccgctcagca 
agcttctcgc 
agccaggtgt 
aaggcgatgc 
ggeggctteg 
gacgaggaag 
gccggcgacc 
aagcggcgcg 
ccggtgtcca 



tgcaagccgc 
tggcccgggc 
getatcegtg 
accacgacaa 
gcaagcaccg 
tcgcccaggc 
cgacgcacgt 
agaeggctat 
agtacgegat 
tctaccgcgg 
acgcggccga 
aegaactget 
cgcgcatcta 
ggctggtggt 
cggccggcca 
agccggtggt 
cgccgagcga 



ccccgtcttc 
ggccggccag 
gtggatctgg 
cgccctggtc 
catctgggtc 
gctgatcgac 
ggagcgcacc 
cggcaacatc 
gtactcgcag 
cggcgcctac 
gggccagtgc 
gtgcaccgac 
cggacccgac 
ggccgacatc 
etattcgegg 
getgeaagge 
ageggeggeg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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gccgccgccc gccagccggt cgtggcctga 1050 

<210> 378 
<211> 349 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 378 



Met 


Ser 


Ser 


Thr 


Thr 


His 


Pro 


Arg 


Leu 


Arg 


Val 


Ala 


Ala 


Val 


Gin 


Ala 


1 








5 










10 










15 




Ala 


Pro 


Val 


Phe 


Leu 


Asp 


Leu 


Asp 


Gly 


Thr 


He 


Asp 


Lys 


Thr 


He 


Asp 








20 










25 










30 






Leu 


Met 


Ala 


Arg 


Ala 


Ala 


Gly 


Gin 


Gly 


Val 


Gin 


Leu 


He 


Ala 


Phe 


Pro 






35 










40 










45 








Glu 


Thr 


Trp 


Val 


Pro 


Gly 


Tyr 


Pro 


Trp 


Trp 


He 


Trp 


Leu 


Asp 


Ser 


Pro 




50 










55 










60 










Ala 


Trp 


Gly 


Met 


Gin 


Phe 


Val 


Gin 


Arg 


Tyr 


His 


Asp 


Asn 


Ala 


Leu 


Val 


65 










70 










75 










80 


Val 


Gly 


Ser 


Pro 


Glu 


Phe 


Asp 


Arg 


He 


Arg 


Glu 


Ala 


Ala 


Arg 


Lys 


His 










85 










90 










95 




Arg 


lie 


Trp 


Val 


Ser 


Leu 


Gly 


Tyr 


Ser 


Glu 


Lys 


Ala 


Ala 


Gly 


Ser 


Leu 








100 










105 










110 






Tyr 


He 


Ala 


Gin 


Ala 


Leu 


He 


Asp 


Asp 


Gin 


Gly 


Asn 


Thr 


Leu 


Gin 


Thr 






115 










120 










125 








Arg 


Arg 


Lys 


Leu 


Lys 


Pro 


Thr 


His 


Val 


Glu 


Arg 


Thr 


Val 


Phe 


Gly 


Glu 




130 










135 










140 










Gly 


Asp 


Gly 


Ser 


Asp 


Leu 


Ser 


Val 


Val 


Glu 


Thr 


Ala 


He 


Gly 


Asn 


He 


145 










150 










155 










160 


Gly 


Ser 


Leu 


Ser 


Cys 


Trp 


Glu 


His 


Leu 


Gin 


Pro 


Leu 


Ser 


Lys 


Tyr 


Ala 










165 










170 










175 




Met 


Tyr 


Ser 


Gin 


Asn 


Glu 


Gin 


He 


His 


Cys 


Gly Ala 


Trp 


Pro 


Ser 


Phe 








180 










185 










190 






Ser 


Leu 


Tyr 


Arg 


Gly 


Gly 


Ala 


Tyr 


Ala 


Leu 


Gly Ala 


Glu 


Val 


Asn 


Asn 






195 










200 










205 








Ala 


Ala 


Ser 


Gin 


Val 


Tyr 


Ala 


Ala 


Glu 


Gly 


Gin 


Cys 


Phe 


Val 


He 


Ala 




210 










215 










220 










Pro 


Cys 


Ala 


Thr 


Val 


Ser 


Lys 


Ala 


Met 


His 


Glu 


Leu 


Leu 


Cys 


Thr 


Asp 


225 










230 










235 










240 


Pro 


Gly 


Lys 


Gin 


Gin 


Met 


Leu 


Leu 


Val 


Gly 


Gly 


Gly 


Phe 


Ala 


Arg 


He 










245 










250 










255 




Tyr 


Gly 


Pro 


Asp 


Gly 


Ser 


Pro 


Leu 


Gly 


Lys 


Asn 


Leu 


Ala 


Glu 


Asp 


Glu 








2 60 










265 










270 






Glu 


Gly 


Leu 


Val 


Val 


Ala 


Asp 


He 


Asp 


Leu 


Gly Met 


He 


Ser 


Leu 


Ala 






275 










280 










285 








Lys 


Ala 


Ala 


Gly 


Asp 


Pro 


Ala 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 


Val 


Thr 




290 










295 










300 










Gin 


Leu 


Leu 


Phe 


Asn 


Arg 


Lys 


Arg 


Arg 


Glu 


Pro 


Val 


Val 


Leu 


Gin 


Gly 


305 










310 










315 










320 


Pro 


Ala 


Glu 


Pro 


Glu 


Lys 


Ala 


Val 


Ala 


Glu 


Pro 


Val 


Ser 


Thr 


Pro 


Ser 










325 










330 










335 




Glu 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Arg 


Gin 


Pro 


Val 


Val 


Ala 














340 










345 

















<210> 379 
<211> 936 
<212> DNA 
<213> Unknown 
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<220> 

<223> Obtained from an environmental sample 



<400> 379 

atggctaaaa 

gccagtgctg 

actgaggcat 

tgggggctat 

gatgacctgg 

attaatgaac 

ggttccgatg 

atggtatggg 

gttggtacgc 

cagggggtag 

ttacagcata 

ggcagcgata 

tggataaatc 

ttgcgccagg 

aaacgtgcgc 

aatacgaagg 



tagcgattgt 
taagagaggt 
ttattgctgg 
ctgaagatct 
atccacttta 
gtgataaccg 
gttcattgtt 
gctttggtga 
ttgtctgttg 
aggtttatat 
ttgccaggga 
taccggatga 
ctggggattc 
aggcggggat 
tggatgtggc 
tgcagacccc 



acaaaaggcg 
agagctggca 
gtatccggcc 
tcattcccgt 
tgccgcagct 
gctcagtaag 
aaatcgacat 
tgcctctggt 
ggaaaactat 
tgcgccgacc 
gggtcgttgt 
cttccctgag 
cgttgtgatt 
tgtctattgt 
tggacattat 
ggttgtatat 



tctgttacct 
gcggcggaag 
tggatctggc 
ttgttgacaa 
aaagaaaata 
gcaacgctat 
cgtaagttga 
ctgaaggtcg 
atgcccttgg 
tacgatagcg 
tgggttgttg 
aagttggcgt 
gcacctgggg 
gatattgcgt 
tcccggcctg 
gattag 



tgaataaaca 
gtgccgagct 
gtttgcggcc 
gcgccgtaga 
acgtgacgat 
ataattctat 
tgccgacgaa 
ttgatacccc 
ccagatatgc 
gtgatgactg 
gctgtggcaa 
tatacccaga 
gtaaaatcat 
ctgaaagtgc 
atatctttga 



agaaactgtc 
ggtagtgttt 
aggtggtgac 
cctgggtggt 
agtgtgcggt 
cgttattatt 
tccggagaga 
cgttggtcgt 
gttgtattcg 
gatttctaca 
tctattgcgt 
tgaggatgag 
ggccgggcca 
cagtcaggca 
gttgcatgtg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
936 



<210> 380 
<211> 311 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 380 



Met 


Ala 


Lys 


He 


Ala 


He 


Val 


Gin 


Lys 


Ala 


Ser 


Val 


Thr 


Leu 


Asn 


Lys 


1 






5 










10 










15 




Gin 


Glu 


Thr 


Val 
20 


Ala 


Ser 


Ala 


Val 


Arg 
25 


Glu 


Val 


Glu 


Leu 


Ala 
30 


Ala 


Ala 


Glu 


Gly 


Ala 


Glu 


Leu 


Val 


Val 


Phe 


Thr 


Glu 


Ala 


Phe 


He 


Ala 


Gly 


Tyr 




35 










40 










45 








Pro 


Ala 


Trp 


He 


Trp 


Arg 


Leu 


Arg 


Pro 


Gly 


Gly Asp 


Trp 


Gly 


Leu 


Ser 




50 








55 










60 










Glu 


Asp 


Leu 


His 


Ser 


Arg 


Leu 


Leu 


Thr 


Ser 


Ala 


Val 


Asp 


Leu 


Gly 


Gly 


65 








70 










75 










80 


Asp 


Asp 


Leu 


Asp 


Pro 


Leu 


Tyr 


Ala 


Ala 


Ala 


Lys 


Glu 


Asn 


Asn 


Val 


Thr 








85 










90 










95 




He 


Val 


Cys 


Gly 
100 


He 


Asn 


Glu 


Arg 


Asp 
105 


Asn 


Arg 


Leu 


Ser 


Lys 
110 


Ala 


Thr 


Leu 


Tyr 


Asn 


Ser 


He 


Val 


He 


He 


Gly 


Ser 


Asp 


Gly 


Ser 


Leu 


Leu 


Asn 




115 










120 










125 








Arg 


His 


Arg 


Lys 


Leu 


Met 


Pro 


Thr 


Asn 


Pro 


Glu 


Arg 


Met 


Val 


Trp 


Gly 


130 










135 










140 










Phe 


Gly Asp 


Ala 


Ser 


Gly 


Leu 


Lys 


Val 


Val 


Asp 


Thr 


Pro 


Val 


Gly 


Arg 


145 










150 










155 










160 


Val 


Gly 


Thr 


Leu 


Val 


Cys 


Trp 


Glu 


Asn 


Tyr 


Met 


Pro 


Leu 


Ala 


Arg 


Tyr 








165 










170 










175 




Ala 


Leu 


Tyr 


Ser 


Gin 


Gly 


Val 


Glu 


Val 


Tyr 


He 


Ala 


Pro 


Thr 


Tyr 


Asp 






180 










185 










190 






Ser 


Gly Asp 


Asp 


Trp 


He 


Ser 


Thr 


Leu 


Gin 


His 


He 


Ala 


Arg 


Glu 


Gly 






195 










200 










205 








Arg 


Cys 


Trp 


Val 


Val 


Gly 


Cys 


Gly 


Asn 


Leu 


Leu 


Arg 


Gly Ser Asp 


He 


210 










215 










220 










Pro 


Asp 


Asp 


Phe 


Pro 


Glu 


Lys 


Leu 


Ala 


Leu 


Tyr 


Pro 


Asp 


Glu 


Asp 


Glu 


225 








230 










235 










240 


Trp 


He 


Asn 


Pro 


Gly 


Asp 


Ser 


Val 


Val 


He 


Ala 


Pro 


Gly Gly 


Lys 


He 
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245 250 255 

Met Ala Gly Pro Leu Arg Gin Glu Ala Gly He Val Tyr Cys Asp He 

260 265 270 

Ala Ser Glu Ser Ala Ser Gin Ala Lys Arg Ala Leu Asp Val Ala Gly 

275 280 285 

His Tyr Ser Arg Pro Asp He Phe Glu Leu His Val Asn Thr Lys Val 

290 295 300 

Gin Thr Pro Val Val Tyr Asp 
305 310 

<210> 381 

<211> 945 

<212> DNA 

<213> Clostridium acetobutylicum ATCC 3625 



<400> 381 

atgggcgaat 

cgcgaggcct 

gatcttctgg 

tggtctcgcg 

ggtccggaaa 

ggagtagtag 

agccgcgagg 

cgttactggt 

cttagcggcc 

cagggcaccc 

ctgtctcgtg 

ccggccccag 

gagtcatgta 

acccttatca 

gtgggcggtc 

ccccgtgtcg 



tcggtgaagt 
cgaccgagaa 
cgttcggtga 
aatacaacga 
cggacgctct 
aactggagcc 
gcgagatcct 
ctgaaggcga 
tgaattgctg 
agttccatgt 
cctacgctat 
gtgaactgcc 
tcatcgaccc 
cggctcgcgt 
attatagccg 
tgtttcgcga 



taccctgggc 
agcctgtggc 
aacgtggtta 
cttgcgtgca 
gtgccaagca 
gggctctctt 
gggtcgtcac 
cgcgactggt 
ggagcacact 
ggccgcttgg 
gcaggcgggc 
ggatggtatc 
gtgggggaaa 
tagcaccgca 
cccggatgtt 
tggtgacgtg 



gtggctcagg 
ctgatccgcg 
accggctatc 
cgttatgttg 
gccgcggaag 
tcctcggttt 
cgcaaactga 
ctgcgcgttt 
atgatgctgc 
ccaaacatgg 
tgctacgttt 
gcggcggaaa 
gtaattgctg 
tctatttatc 
ttccgttttg 
gatgaccgag 



ctgctcccgt 
aggcgggcga 
catactggaa 
cgaacggtgt 
caggtgtgga 
attgcactct 
aaccgaccga 
atgagcgccc 
cggggtacgc 
catcctcgaa 
tatgcgcggg 
gtttagatca 
gtccggtgtc 
gccgcaaaag 
aagtcgatcg 
gttaa 



gtactttgac 
aaagggtgta 
agatgcgcct 
gatgattcct 
tgtggcgatc 
gttatttata 
tagcgaacgt 
atatggtcgg 
cctggcggcg 
ttctgaactt 
tggcctgggc 
cttgactgga 
ttgcgaggaa 
tttgacggac 
ctctgagcgt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
945 



<210> 382 
<211> 314 
<212> PRT 

<213> .Clostridium acetobutylicum ATCC 3625 



<400> 382 
























Ala 




Met 


Gly 


Glu 


Phe 


Gly 


Glu 


Val 


Thr 


Leu 


Gly 


Val 


Ala 


Gin 


Ala 


Pro 


1 






5 










10 










15 




Val 


Tyr 


Phe 


Asp 


Arg 


Glu 


Ala 


Ser 


Thr 


Glu 


Lys 


Ala 


Cys 


Gly 


Leu 


He 






20 










25 










30 






Arg 


Glu 


Ala 


Gly 


Glu 


Lys 


Gly Val 


Asp 


Leu 


Leu 


Ala 


Phe 


Gly 


Glu 


Thr 




35 










40 










45 






Glu 


Trp 


Leu 


Thr 


Gly 


Tyr 


Pro 


Tyr 


Trp 


Lys 


Asp 


Ala 


Pro 


Trp 


Ser 


Arg 


50 










55 










60 










Tyr 


Asn 


Asp 


Leu 


Arg 


Ala 


Arg 


Tyr 


Val 


Ala 


Asn 


Gly 


Val 


Met 


He 


Pro 


65 










70 










75 










80 


Gly 


Pro 


Glu 


Thr 


Asp 


Ala 


Leu 


Cys 


Gin 


Ala 


Ala 


Ala 


Glu 


Ala 


Gly 


Val 








85 










90 










95 




Asp 


Val 


Ala 


He 


Gly 


Val 


Val 


Glu 


Leu 


Glu 


Pro 


Gly 


Ser 


Leu 


Ser 


Ser 






100 










105 










110 






Val 


Tyr 


Cys 


Thr 


Leu 


Leu 


Phe 


He 


Ser 


Arg 


Glu 


Gly 


Glu 


He 


Leu 


Gly 




115 










120 










125 








Arg 


His 


Arg 


Lys 


Leu 


Lys 


Pro 


Thr 


Asp 


Ser 


Glu 


Arg 


Arg 


Tyr 


Trp 


Ser 


130 










135 










140 










Glu 


Gly 


Asp 


Ala 


Thr 


Gly 


Leu 


Arg 


Val 


Tyr 


Glu 


Arg 


Pro 


Tyr 


Gly 


Arg 


145 






150 










155 










160 


Leu 


Ser 


Gly 


Leu 


Asn 


Cys 


Trp 


Glu 


His 


Thr 


Met 


Met 


Leu 


Pro 


Gly 


Tyr 






165 






170 










175 
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Aia 


Leu 


Ala 


Ala 








180 


Met 


Ala 


Ser 


Ser 






i y o 




nla 


biy 


Cys 


Tyr 




Pin 

X u 






LjIU 


Leu 


Pro 


Asp 


d. £. O 








blU 


Ser 


Cys 


lie 


Ser 


Cys 


Glu 


Glu 








260 


Tyr 


Arg 


Arg 


Lys 






275 




Asp 


Val 


Phe 


Arg 




290 






Phe 


Arg 


Asp 


Gly 



Gin Gly Thr Gin 

Asn Ser Glu Leu 
200 

Val Leu Cys Ala 
215 

Gly lie Ala Ala 
230 

lie Asp Pro Trp 
245 

Thr Leu lie Thr 

Ser Leu Thr Asp 
280 

Phe Glu Val Asp 
295 

Asp Val Asp Asp 
310 



Phe His Val Ala 
185 

Leu Ser Arg Ala 

Gly Gly Leu Gly 
220 

Glu Ser Leu Asp 
235 

Gly Lys Val lie 
250 

Ala Arg Val Ser 
265 

Val Gly Gly His 

Arg Ser Glu Arg 
300 

Arg Gly 



Ala Trp Pro Asn 
190 

Tyr Ala Met Gin 
205 

Pro Ala Pro Gly 

His Leu Thr Gly 
240 

Ala Gly Pro Val 

255 

Thr Ala Ser lie 
270 

Tyr Ser Arg Pro 
285 

Pro Arg Val Val 



<210> 383 
<211> 1041 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 383 

atgtcggagc 

gatctcgacc 

gtgcgcctga 

ggcgcgccgg 

ggcagcaagc 

gccggctata 

gatggcgatc 

ttcggcgagg 

gcgctctgtt 

gaacaggtcc 

ctcggaccgg 

gtgctggcgt 

gacaaggaga 

cgcgccctgg 

ctcggcatga 

gacgtaacgc 

gcattcgaac 

agcgccgccg 



ccatgacgaa 
gcacagtcga 
tcgcattccc 
cttggggcat 
agtggcaggc 
gcgagcgcgc 
tgatcgccgc 
gagacggcag 
gctgggagca 
acgtcgcgtc 
aggtcaatac 
cgtgcgcgac 
tgttcctcaa 
ccgagccgct 
tcgcgttggc 
ggctgctgct 
cgcaaaacga 
ccgcgcagta 



gtatcgcggc 
gaaagcgatc 
agagacttgg 
gcgcttcgtc 
cctggcggat 
gggcggcagc 
gcgccgcaag 
ccatctcgcg 
catccagcca 
gtggccgagc 
cgccgcaagc 
cgtttcgccg 
ggccggcggc 
cccggagacc 
caaggcggcg 
ggatcgacgt 
ggacaagggc 

g 



gcggcggtgc 
ggcctgatcg 
attcccggct 
cagcgctatt 
gcggcccgcc 
ctctatatgg 
ctcaagccta 
gtgcacgata 
ttgtcgaaat 
ttcagcctct 
cagatctacg 
gagatgatca 
ggttttgcca 
gaagagggac 
gccgatccgg 
ccggcccaac 
gacgcgcccg 



aggccgcgcc 
agcaggcggc 
atcccttttg 
tcgagaattc 
gccacggcat 
gccaggcgat 
cccatgcgga 
ccgccatcgg 
acgccatgta 
atcgcggcat 
cggtcgaggg 
aggtattggt 
tgattttcgg 
tgctggtcgc 
cgggccacta 
gcgtcgtcac 
cgctgcgcgt 



ggtgttcctc 
caagcaggac 
gatatggctg 
gctcgtgcgc 
gcatgtcgtg 
cttcggcccc 
gcgcaccgtg 
gcgcctcggc 
cgccgccgac 
ggcctatgcg 
cggctgctac 
ggatacgccc 
gcccgacggc 
cgatatcgac 
ttcacggccc 
gcttgatgcc 
ggtggcggaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1041 



<210> 384 
<211> 346 
<212> PRT 
•<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
<400> 384 

Met Ser Glu Pro Met Thr Lys Tyr Arg Gly Ala Ala Val Gin Ala Ala 

15 10 15 

Pro Val Phe Leu Asp Leu Asp Arg Thr Val Glu Lys Ala lie Gly Leu 

20 25 30 

lie Glu Gin -Ala Ala Lys Gin Asp Val Arg Leu He Ala Phe Pro Glu 
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35 

Thr Trp lie Pro 
50 

Trp Gly Met Arg 
65 

Gly Ser Lys Gin 

Met His Val Val 
100 

Met Gly Gin Ala 
115 

Arg Lys Leu Lys 
130 

Asp Gly Ser His 
145 

Ala Leu Cys Cys 

Tyr Ala Ala Asp 
180 

Leu Tyr Arg Gly 
195 

Ala Ser Gin lie 
210 

Cys Ala Thr Val 
225 

Asp Lys Glu Met 

Gly Pro Asp Gly 
260 

Gly Leu Leu Val 
275 

Ala Ala Ala Asp 
290 

Leu Leu Leu Asp 
305 

Ala Phe Glu Pro 

Val Val Ala Glu 
340 



40 

Gly Tyr Pro Phe 
55 

Phe Val Gin Arg 
70 

Trp Gin Ala Leu 
85 

Ala Gly Tyr Ser 

lie Phe Gly Pro 
120 

Pro Thr His Ala 
135 

Leu Ala Val His 
150 

Trp Glu His lie 
165 

Glu Gin Val His 

Met Ala Tyr Ala 
200 

Tyr Ala Val Glu 
215 

Ser Pro Glu Met 
230 

Phe Leu Lys Ala 

245 

Arg Ala Leu Ala 

Ala Asp lie Asp 
280 

Pro Ala Gly His 
295 

Arg Arg Pro Ala 
310 

Gin Asn Glu Asp 
325 

Ser Ala Ala Ala 



Trp lie Trp Leu 
60 

Tyr Phe Glu Asn 
75 

Ala Asp Ala Ala 
90 

Glu Arg Ala Gly 
105 

Asp Gly Asp Leu 

Glu Arg Thr Val 
140 

Asp Thr Ala He 
155 

Gin Pro Leu Ser 
170 

Val Ala Ser Trp 
185 

Leu Gly Pro Glu 

Gly Gly Cys Tyr 
220 

He Lys Val Leu 
235 

Gly Gly Gly Phe 
250 

Glu Pro Leu Pro 
265 

Leu Gly Met He 

Tyr Ser Arg Pro 
300 

Gin Arg Val Val 
315 

Lys Gly Asp Ala 

330 
Ala Gin 
345 



45 

Gly Ala Pro Ala 

Ser Leu Val Arg 
80 

Arg Arg His Gly 
95 

Gly Ser Leu Tyr 
110 

He Ala Ala Arg 
125 

Phe Gly Glu Gly 

Gly Arg Leu Gly 
160 

Lys Tyr Ala Met 
175 

Pro Ser Phe Ser 
190 

Val Asn Thr Ala 
205 

Val Leu Ala Ser 

Val Asp Thr Pro 
240 

Ala Met He Phe 
255 

Glu Thr Glu Glu 
270 

Ala Leu Ala Lys 
285 

Asp Val Thr Arg 

Thr Leu Asp Ala 
320 

Pro Ala Leu Arg 
335 



<210> 385 
<211> 1014 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 385 

atgaaagaag 

gcgacggtgg 

atcgcctttc 

gcatgggcaa 

caagctaagc 

agtgaacggg 

accattgggg 

ggggatggtt 

tgttgggagc 

cattgtgcgg 

gatgtcaatg 

tcgtgtgcgc 



ctatcaaggt 
acaaaaccat 
cggaaacttg 
tgcaatttgt 
gcatttcaga 
tcggtggcac 
cccggcgaaa 
catcgctagc 
accttcaacc 
cttggccgag 
tagcggcctc 
tcgtttcaca 



cgcctgcgtg 
tgagttgatg 
gattccaggc 
acgccaatac 
tgcagccaag 
cctttacatc 
gttgaaacct 
ggttttcgag 
gctaacaaaa 
ctttagcctt 
tcgaatctat 
atccatgatc 



caagccgccc 
gaagaagcag 
tacccatggt 
catgagaact 
cggttgggaa 
agtcagtggt 
acttttgttg 
acgtctgttg 
tacgctttgt 
tatcctaatg 
gccgttgaag 
gatatgcttt 



cgatctacat 
cacgtaataa 
ttctttggct 
cattggagtt 
tcatggtcac 
tcataggcga 
aacgtacttt 
gaaggctggg 
atgcacaaaa 
cggcgaaagc 
ggcaatgctt 
gtacagatga 



ggatttgaag 
tgctcgtctg 
tgactcacca 
ggatggccct 
cctggggatg 
taatggtgac 
gttcggcgaa 
tggcttatgc 
tgaagagatt 
cctggggcct 
cgtactagcg 
cgaaaagcat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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gcgttgcttc tggctggtgg tggacactca cgtatcatag ggcctgatgg tggtgacttg 780 

gtcgcgcctc ttgccgaaaa tgaagagggt attctctacg caaaccttga tcctggagta 84 0 

cgcatccttg ctaaaatggc ggcagaccct gctggtcatt attcccgtcc cgacattact 900 

cgcttgctaa tagatcgcag ccctaaatta ccggtagttg aaattgaagg tgatcttcgt 960 

ccttacgctt tgggtaaagc gtctgagacg ggtgcgcaac tcgaagaaat ttga 1014 

<210> 386 
<211> 337 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 386 




























Met 


Lys 


Glu 


Ala 


He 


Lys 


Val 


Ala 


Cys 


Val 


Gin 


Ala 


Ala 


Pro 


He 


Tyr 


1 






5 










10 










15 




Met 


Asp 


Leu 


Lys 


Ala 


Thr 


Val 


Asp 


Lys 


Thr 


He 


Glu 


Leu 


Met 


Glu 


Glu 






20 










25 










30 






Ala 


Ala 


Arg 
35 


Asn 


Asn 


Ala 


Arg 


Leu 
40 


He 


Ala 


Phe 


Pro 


Glu 
45 


Thr 


Trp 


He 


Pro 


Gly 
50 


Tyr 


Pro 


Trp 


Phe 


Leu 
55 


Trp 


Leu 


Asp 


Ser 


Pro 
60 


Ala 


Trp 


Ala 


Met 


Gin 


Phe 


Val 


Arg 


Gin 


Tyr 


His 


Glu 


Asn 


Ser 


Leu 


Glu 


Leu 


Asp 


Gly 


Pro 


65 










70 










75 










80 


Gin 


Ala 


Lys 


Arg 


He 
85 


Ser 


Asp 


Ala 


Ala 


Lys 
90 


Arg 


Leu 


Gly 


He 


Met 

95 


Val 


Thr 


Leu 


Gly 


Met 
100 


Ser 


Glu 


Arg 


Val 


Gly 
105 


Gly 


Thr 


Leu 


Tyr 


He 
110 


Ser 


Gin 


Trp 


Phe 


He 


Gly Asp 


Asn 


Gly 


Asp 


Thr 


He 


Gly Ala 


Arg 


Arg 


Lys 


Leu 




115 










120 










125 








Lys 


Pro 


Thr 


Phe 


Val 


Glu 


Arg 


Thr 


Leu 


Phe 


Gly 


Glu 


Gly 


Asp 


Gly 


Ser 


130 










135 










140 










Ser 


Leu 


Ala 


Val 


Phe 


Glu 


Thr 


Ser 


Val 


Gly Arg 


Leu 


Gly 


Gly 


Leu 


Cys 


145 










150 










155 










160 


Cys 


Trp 


Glu 


His 


Leu 


Gin 


Pro 


Leu 


Thr 


Lys 


Tyr 


Ala 


Leu 


Tyr 


Ala 


Gin 






165 










170 










175 




Asn 


Glu 


Glu 


He 
180 


His 


Cys 


Ala 


Ala 


Trp 
185 


Pro 


Ser 


Phe 


Ser 


Leu 
190 


Tyr 


Pro 


Asn 


Ala 


Ala 


Lys 


Ala 


Leu 


Gly 


Pro 


Asp 


Val 


Asn 


Val 


Ala 


Ala 


Ser 


Arg 






195 








200 










205 








He 


Tyr 
210 


Ala 


Val 


Glu 


Gly 


Gin 
215 


Cys 


Phe 


Val 


Leu 


Ala 
220 


Ser 


Cys 


Ala 


Leu 


Val 


Ser 


Gin 


Ser 


Met 


He 


Asp 


Met 


Leu 


Cys 


Thr 


Asp 


Asp 


Glu 


Lys 


His 


225 










230 










235 










240 


Ala 


Leu 


Leu 


Leu 


Ala 
245 


Gly 


Gly 


Gly 


His 


Ser 
250 


Arg 


He 


He 


Gly 


Pro 

255 


Asp 


Gly 


Gly Asp 


Leu 


Val 


Ala 


Pro 


Leu 


Ala 


Glu 


Asn 


Glu 


Glu 


Gly 


He 


Leu 






260 










265 










270 






Tyr 


Ala 


Asn 


Leu 


Asp 


Pro 


Gly Val 


Arg 


He 


Leu 


Ala 


Lys 


Met 


Ala 


Ala 




275 










280 










285 








Asp 


Pro 


Ala 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 


He 


Thr 


Arg 


Leu 


Leu 


He 


290 










295 










300 










Asp 


Arg 


Ser 


Pro 


Lys 


Leu 


Pro 


Val 


Val 


Glu 


He 


Glu 


Gly Asp 


Leu 


Arg 


305 








310 










315 










320 


Pro 


Tyr 


Ala 


Leu 


Gly 


i*y s 


Ala 


Ser 


Glu 


Thr 


Gly Ala 


Gin 


Leu 


Glu 


Glu 








325 










330 










335 





He 
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